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L  ABSTRACT 

Objective:  To  evaluate  the  accuracy  of  the  Acute  Physiology  and  Chronic 
Health  Evaluation  (APACHE)  II  prognostic  scoring  system  when  applied  to 
human  immunodeficiency  virus  (HIV)  seropositive  patients  (with  or 
without  the  acquired  immunodeficiency  syndrome  (AIDS))  in  the  Medical 
Intensive  Care  Unit  (MICU). 

Methods:  The  medical  records  of  all  HIV-positive  patients  who  were 
discharged  from  the  Yale-New  Haven  Hospital  MICU  between  October  1,  1986 
and  September  30,  1991  were  retrospectively  reviewed  and  clinical  and 
laboratory  data,  including  variables  needed  for  APACHE  II  scoring,  were 
collected.  Patients  were  assigned  to  disease  categories  using  pre-determined 
algorithms  when  documented  clinical  decisionmaking  was  unclear. 

Results:  Records  from  161  separate  MICU  discharges  met  the  criteria  for  the 
study.  APACHE  II  greatly  underestimated  mortality  among  patients  with 
pneumonia  (n=44)  (28.5%  estimated  (E)  vs.  50.0%  observed  (O),  p  <  .005).  In 
the  group  of  patients  with  pneumonia,  APACHE  II  accurately  predicted 
mortality  in  patients  with  Pneumocystis  cnrinii  pneumonia  (n=26)  (28.2%  (E) 
vs.  34.6%  (O))  but  underestimated  mortality  in  patients  with  pneumonia  of 
other  or  unknown  etiology  (n=18)  (28.9%  (E)  vs.  72.2%  (O),  p  <  .005).  In  all 
other  patients  (patients  without  pneumonia)  (n=117),  APACHE  II  correctly 
estimated  mortality  (38.1%  (E)  vs.  41.9%  (O')).  In  patients  admitted  to  the 
MICU  with  a  diagnosis  of  infection  (pneumonia  or  sepsis),  a  total  lymphocyte 
count  (TLC)  <  200  was  a  strong  predictor  of  mortality.  In  all  other  patients 
(without  pneumonia  or  sepsis),  TLC  was  not  associated  with  outcome. 
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Conclusion:  The  APACHE  II  prognostic  scoring  system  is  an  effective  method 
of  stratifying  a  general  HIV-positive  MICU  patient  population  in  respect  to  in- 
hospital  mortality,  but  it  is  not  accurate  when  applied  to  a  significant 
proportion  of  our  population,  HIV-positive  patients  with  pneumonia, 
particularly  non-Pneumocystis  carinii  pneumonia.  The  reason  for  APACHE 
II's  underestimation  of  mortality  is  multifactorial.  It  is  likely  to  be  due,  at 
least  partially,  to  (1)  absence  of  adequate  risk  adjustment  for  severity  of 
immunologic  suppression;  (2)  lack  of  precisely  defined  decisionmaking 
algorithms  to  aid  in  the  assignment  of  disease  categories;  and  (3)  a  significant 
proportion  of  HIV-positive  patients  with  pneumonia  of  unknown  etiology 
who  may  be  uniquely  resistant  to  treatment.  The  use  of  TLC  as  an  additional 
risk  factor  may  improve  predictive  accuracy  of  ICU  risk  adjustment  models 
for  use  in  HIV-positive  patients  with  diagnoses  of  infection. 

IL  INTRODUCTION 


Beginning  in  the  mid-1980's,  the  human  immunodeficiency  virus 
(HIV)  and  the  acquired  immunodeficiency  syndrome  (AIDS)  have  had  an 
enormous  impact  on  the  American  health  care  system  and  particularly  on  the 
field  of  intensive  care  medicine.  In  1990,  there  were  an  estimated  one  million 
people  infected  with  HIV  in  the  United  States.1  Of  these  one  million  people, 
it  has  been  projected  that  nearly  205,000  will  have  frank  AIDS  in  1993  and 
nearly  246,000  in  1994. 2  Many  of  these  people  with  AIDS  will  need  to  be 
admitted  to  an  intensive  care  unit  at  some  time  during  their  illness.  Using 
an  estimate  that  the  cost  of  treating  all  people  infected  with  HIV  will  increase 
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roughly  20%  per  year,  it  has  been  projected  that  $10.4  billion  will  be  spent 
treating  all  people  infected  with  HIV  in  1994  alone.2 

Because  AIDS  has  spread  so  rapidly  and  has  affected  such  a  great 
proportion  of  the  population,  much  research  has  been  focused  on  elucidating 
the  pathophysiology  of  the  human  immune  system  and  the  role  of 
opportunistic  infections  in  immunocompromised  hosts.  In  addition  to 
promoting  basic  science  research,  HIV  infection  and  the  "AIDS  epidemic" 
have  led  to  greater  discussion  about  the  ethical  dilemmas  surrounding 
quality  of  life  decisions,3'14  and  the  aggressive  medical  treatment  of  patients 
with  serious  and/or  terminal  illnesses.15'13  The  healthcare  system,  and 
society  as  a  whole,  is  faced  with  difficult  and  interconnected  ethical  and 
financial  decisions.  Physicians  are  being  forced  to  do  cost-benefit  analyses  in 
addition  to  making  the  medical  decisions  for  which  they  were  trained.  The 
sort  of  decisions  being  made  are  variations  on  one  simple  question:  "Is  the 
result  worth  the  cost?"  In  other  words,  do  the  medical  outcomes  justify  the 
financial  resources  being  spent? 

This  question  must  be  considered  in  all  medical  settings  and  involving 
all  types  of  medical/financial  decisions,  ranging  from  state-funded  childhood 
immunization  programs  to  organ  transplant  surgery.  To  answer  these 
questions,  patient  outcomes  must  be  assessed  and  analyzed.  Until  recently, 
however,  the  means  to  do  this  analysis  have  been  lacking.  Although  the  raw 
data  (mortality  rates,  nosocomial  infection  rates,  surgical  complication  rates, 
etc.)  have  been  available,  medical  outcome  analysis  had  proved  to  be  difficult 
to  analyze  since  medical  outcomes  are  dependent  on  a  wide  array  of  variables. 
Medical  outcomes  are  dependent  not  only  upon  the  quality  of  medical  care 
but  also  upon  other  factors,  including  comorbid  illnesses,  access  to  care, 
patients'  extent  of  acute  illness,  and  etiology  of  disease. 
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One  area  of  medicine  in  which  cost-benefit  discussions  and  medical 
outcome  analysis  has  become  common  has  been  in  the  intensive  care  setting. 
In  intensive  care  units  (ICU's),  prognostic  stratification  systems  have  been 
developed  to  better  analyze  medical  outcomes.  The  prognostic  stratification 
systems  adjust  for  the  variability  of  patient  "case  mix."  In  other  words,  these 
systems  are  means  of  performing  risk  adjustment  related  to  medical 
outcomes.  By  accounting  for  varying  degrees  of  patient  "risk,"  these 
prognostic  systems  allow  more  accurate  outcome  comparisons  to  be  made  and 
facilitate  cost-benefit  analyses. 

The  most  widely  accepted  system  of  risk  adjustment  in  the  ICU  setting 
is  the  Acute  Physiology  and  Chronic  Health  Evaluation  (APACHE)  II 
prognostic  scoring  system.  Presently,  however,  APACHE  II  has  not  been 
thoroughly  evaluated  in  patients  who  are  HIV  positive  or  who  have  AIDS. 
Thus,  healthcare  professionals  are  lacking  the  tools  necessary  to  effectively 
analyze  medical  outcomes  in  HIV-positive  patients.  Because  a  reliable 
research  tool  is  needed  for  outcome  analysis  in  the  increasing  numbers  of 
HIV-positive  patients,  an  evaluation  of  APACHE  II's  ability  to  perform  risk 
adjustment  in  HIV-positive  patients  was  undertaken.  Before  discussing  the 
current  study  and  the  recent  literature  pertaining  to  the  application  of 
APACHE  II  to  an  HIV-positive  patients  population,  understanding  the 
pertinent  issues  will  be  enhanced  by  briefly  reviewing  the  field  of  prognostic 
stratification  and  the  development  of  the  original  APACHE  and  APACHE  II 
prognostic  scoring  systems. 


Prognostic  stratification 
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Perhaps  the  easiest  way  to  introduce  the  concept  of  prognostic 
stratification  is  to  consider  the  numerous  ways  in  which  it  is  used  by 
everyone  in  everyday  life,  on  a  minute-to-minute  basis.  When  an 
automobile  driver  accelerates  at  a  green  light,  when  a  baseball  outfielder  shifts 
position  for  a  left-handed  batter,  or  when  a  diner  orders  a  meal,  prognostic 
stratification  is  actively,  though  usually  without  awareness,  being  used.  In 
each  of  these  "routine”  daily  circumstances,  a  decision  is  being  made  based  on 
past  experiences  in  similar  situations.  The  driver  realizes  that  in  the  past 
when  he  has  been  waiting  at  an  intersection  and  the  light  has  turned  green,  it 
has  usually,  if  not  always,  been  safe  to  accelerate  and  cross  the  intersection. 

The  outfielder,  probably  with  greater  awareness  of  his  mental  processes, 
recognizes,  based  on  past  experience  (either  his  own  or  communicated  by 
others),  that  a  left-handed  batter  is  more  likely  to  hit  the  ball  toward  right 
field,  and  consequently,  it  would  be  wise  to  shift  his  field  position  accordingly. 
The  diner,  preparing  for  a  meal,  remembers  his  own  past  dining  experiences 
(or  those  experiences  told  to  him  by  friends),  and  orders  based  on  expectations 
of  gustatory  pleasure.  In  each  of  these  circumstances,  past  experiences  are 
used  to  modify  or  make  decisions  in  the  present  with  an  anticipated  future 
result. 

Despite  being  done  unconsciously  or  with  little  thought,  each  of  these 
hypothetical  decisions  is  not  simply  based  on  vague,  ill-defined  recollections 
and  remembrances.  Each  person  makes  a  decision  based  on  past  experiences 
of  similar  circumstances  or  situations  (a  green  light,  a  left-handed  batter,  or  a 
cheeseburger)  being  associated  with  another  occurrence  (driving  safety,  a  fly 
ball  to  right  field,  or  enjoying  one's  meal).  In  effect,  each  decision  is  based  on 
nothing  more  than  past  associations  and  the  assumption  that  past 
associations  will  remain  true  and  valid  in  the  future.  In  other  words,  each 
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decision  in  "routine"  daily  life  is  based  on  calculated  probabilities  arrived  at 
through  past  associations. 

Thus,  prognostic  stratification  is  the  process  in  which  entities 
(circumstances,  occurrences,  people,  etc.)  are  placed  into  different  strata  based 
upon  past  associations  and  future  prognosis.  This  can  be  restated: 

"Prognostic  stratification  is  the  process  in  which  entities  are  placed  into 
groups  with  similar  predicted  results  or  outcomes." 

Given  the  need  in  clinical  medicine  to  base  testing  and  therapy  on 
evidence  of  improved  outcomes,  it  is  no  suprise  that  prognostic  stratification 
is  used  extensively  in  clinical  medicine,  and  that  medicine  is,  in  fact, 
dependent  upon  prognostic  stratification.  All  medical  decisions  implicitly 
utilize  prognostic  stratification.  When  a  physician  discerns  "baseline  risk," 
considers  different  treatments  based  on  the  "clinical  course"  of  a  disease, 
assigns  cancer  patients  to  different  "stages,"  or  chooses  "appropriate" 
antibiotic  therapy  for  a  presumed  infection,  he  or  she  is  actively  grouping 
(stratifying)  patients  according  to  outcome  prediction  (prognosis). 

As  discussed  by  Feinstein,19  prognostic  stratification  has  three  general 
purposes  in  medicine:  analytic,  predictive,  and  explicative.  In  the  first 
instance,  prognostic  stratification  can  be  used  to  reduce  the  effects  of  a 
population's  heterogeneity  and  "enhance  the  accuracy  and  efficiency  of 
statistical  analysis"11'  in  one  of  two  ways.  Prognostic  stratification  can  allow 
for  the  avoidance  of  either  false  positive  error  or  false  negative  error,  and  it 
can  improve  the  efficiency  of  statistical  comparison  of  population  groups. 
False  positive  errors  occur  when  "statistically  significant"  results  are  found 
which  are  solely  caused  by  the  heterogeneity  of  the  compared  population 
strata,  not  by  true  differences  as  measured  by  the  test  in  question.  Feinstein 
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gives  the  following  example19  of  false  positive  error  and  the  use  of  prognostic 
stratification  in  avoiding  this  type  of  error: 

Suppose  we  found  that  a  group  of  100  men  with  disease  D  had  a  10-year 
survival  rate  of  38%,  whereas  the  rate  was  62%  for  a  group  of  100  women  with 
the  same  disease.  We  might  conclude  that  the  disease  had  distinctly  different 
effects  in  men  and  women. 

Now  suppose,  however,  that  patients  with  this  disease  could  be 
divided  into  a  "good  risk"  prognostic  stratum,  with  80%  10-year  survival,  and  a 
"poor  risk"  stratum  with  20%  10-year  survival.  Suppose  further  that  the  male 
population  contained  30  members  from  the  good-risk  stratum  and  70  members 
from  the  poor-risk  stratum,  but  that  these  distributions  were  exactly  reversed 
in  the  female  population.  We  would  then  find  the  following  results  for  10-year 
survival  rates: 


Men  Women  Total 

Good  Risk  24/30(80%)  56/70  (80%)  80/100(80%) 

Poor  Risk  14/70(20%)  6/30(20%)  20/100(20%) 

Total  38/100(38%)  62/100(62%)  100/200(50%) 


On  comparing  results  within  the  same  prognostic  strata,  we  would  find 
that  the  disease  had  exactly  the  same  effects  in  men  as  in  women:  80% 
survival  for  good  risks,  and  20%  survival  for  poor  risks.  Our  original  conclusion 
about  the  disparate  effects  of  the  disease  in  men  and  women  would  have  been 
wrong  because  of  the  disproportionate  distribution  of  the  good-risk  and  poor- 
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risk  strata  in  the  male  and  female  cohorts.  The  male  cohort  was  composed  of 
70%  poor  risks  and  30%  good  risks,  whereas  the  percentages  were  just  the 
opposite  in  the  cohort  of  women. 


Conversely,  false  negative  errors  occur  when  "statistically  significant"  results 
are  not  found  (or  are  hidden)  due  to  the  heterogeneity  of  the  compared 
population  strata. 

Prognostic  stratification  improves  the  efficiency  of  statistical 
comparison  by  identifying  "polar  strata,"  (those  members  of  a  population 
who  are  either  extremely  likely  or  unlikely  to  attain  the  specific  result  being 
considered).  If  allowed  to  remain  unidentified,  members  of  the  "polar  strata" 
will  mix  in  with  the  total  population,  thereby  diluting  the  test's  results  and 
producing  inefficient  statistical  comparison.  In  addition,  the  identification 
and  separation  of  "polar  strata"  is  especially  important  in  the  testing  and 
analysis  of  therapeutic  claims.  For  example,  in  investigating  a  new  treatment 
for  cancer,  it  is  extremely  difficult  to  prove  that  the  new  treatment  is  either 
effective  or  ineffective.  Oftentimes,  in  order  to  produce  statistically 
significant  results,  inordinately  large  (and  financially  constraining) 
population  sample  sizes  must  be  employed.  In  this  situation,  a  polar  stratum 
is  an  ideal  population  to  use  as  a  test  population  because  a  polar  stratum,  by 
definition,  is  either  extremely  likely  or  unlikely  to  attain  a  specific  result  and, 
therefore,  the  predicted  result  will  have  a  small  calculated  standard  error  and 
a  narrow  confidence  interval  (ranging  between  two  standard  errors  below  and 
above  the  mean).  Consequently,  by  testing  the  new  treatment  on  a  polar 
stratum,  statistical  significance  or  insignificance  can  be  more  easily 
determined. 
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The  second  general  purpose  of  prognostic  stratification  is  predictive. 

The  manner  in  which  prognostic  stratification  can  "enhance  confidence  in 
the  predictive  decisions  of  clinical  practice"19  has  been  mentioned  but 
deserves  to  be  discussed  more  thoroughly.  One  way  in  which  prognostic 
stratification  can  improve  predictive  accuracy  is  by  allowing  clinicians  to  base 
current  prognosis  on  the  outcome  of  past  patients  who  were  in  the  same 
prognostic  stratum  as  the  present  subject.  Second,  prognostic  stratification 
can  improve  therapeutic  decisionmaking  by  allowing  physicians  to  choose 
treatments  which  have  been  tested  and  have  proven  results  on  patients  with 
the  same  attributes  as  the  present  patient.  This  use  of  prognostic  stratification 
is  directly  applicable  to  patient-specific  decisionmaking  concerning  the 
initiation  or  withdrawal  of  life-sustaining  or  life-prolonging  therapies.  For 
example,  extremely  low  birthweight  infants  with  a  zero  percent  survival 
probability  usually  will  not  have  heroic  therapy  initiated.  Third,  the  design  of 
therapeutic  trials  can  be  refined  by  dividing  the  study  population  into 
distinctive  prognostic  strata  prior  to  comparing  tested  treatments.  This  can  be 
done  to  improve  either  randomized  or  nonrandomized  trials  and  is 
especially  important  in  nonrandomized  trials  when  stratification  is  the  main 
method  available  to  reduce  bias. 

A  third  use  of  prognostic  stratification  is  "to  demarcate  observed 
phenomena  in  a  manner  that  allow  them  to  receive  a  scientific 
explication."19  In  other  words,  prognostic  stratification  is  useful  for 
explaining  the  variability  or  variance  of  observed  outcomes.  With  the 
current  political  and  administrative  emphasis  on  cost  control  and  resource 
allocation  (ICU  utilization,  nursing  care  staffing,  patient  triage  decisions),  this 
ability  of  prognostic  stratification  to  explain  the  variance  of  outcome 
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measures  is  becoming  especially  important  and  useful  in  health  care  research, 
decisionmaking,  and  the  evaluation  of  medical  care  quality. 

Prognostic  stratification  is  able  to  explain  the  variance  of  outcomes  by 
what  has  been  termed  "risk  adjustment"  or  "case-mix  adjustment."  By 
dividing  a  heterogeneous  patient  population  into  groups  (strata)  with  similar 
medical  expectations  (prognoses),  prognostic  stratification  can  reduce  the 
uncertainty  inherent  in  comparing  outcome  measures  and  allow  a  more 
meaningful  comparison  and  explanation  of  observed  outcomes.  Without 
prognostic  stratification,  any  differences  in  target  events  (ie.,  observed 
mortality  rates,  infection  rates,  etc.)  can  not  be  ascribed  to  actual  differences  in 
the  process  being  measured.  "[T]he  compared  maneuvers  may  appear  to  give 
different  results;  but  the  real  difference  may  arise  only  from  disproportions  in 
the  strata  that  constitute  the  cohort."20 

Outcome  comparisons  can  be  made  at  three  organizational  levels: 
between  individuals,  between  groups  of  people,  or  between  institutions. 
Temporal  comparisons  can  also  be  made:  comparing  outcome  data  from  the 
same  individual,  group,  or  institution  during  different  time  periods. 

Although  some  outcome  measures  are  more  commonly  analyzed  in  specific 
situations  (ie.,  mortality  rates  at  an  interinstitutional  level,  surgical  success 
rates  at  an  individual  level),  any  one  outcome  measure  can  be  analyzed  at  any 
organizational  level,  and  the  inherent  uncertainty  of  all  outcome 
comparisons  is  reduced  by  means  of  prognostic  stratification.  In  fact,  outcome 
analysis  is  virtually  dependent  upon  prognostic  stratification  for  the 
meaningful  interpretation  of  results. 

The  most  common  method  of  using  prognostic  stratification  for  the 
explication  of  outcome  variance  involves  the  development  of  prognostic 
scoring  systems.  These  scoring  systems  can  be  and  have  been  applied  to  a 
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variety  of  clinical  conditions  (discussed  below).  Despite  the  varying 
applications  of  these  systems,  they  are  all  generally  predicated  on  the  same 
analytic  formulations  and  are,  consequently,  very  much  alike.  The  analytic 
model  which  is  followed  is  based  on  mathematical  functions  called  additive 
value  functions.  Though  a  detailed  analysis  of  additive  value  functions  is 
beyond  the  scope  of  this  paper,  a  short  description  of  the  foundation  of  these 
functions  is  worthwhile. 

Additive  value  functions,  as  discussed  by  Krischer,21  are  special 
instances  of  mathematical  formulae  designed  to  quantify  the  relative 
importance  that  one  attribute  makes  toward  a  general  condition.  In  the 
health  sciences,  they  are  generally  expressed  in  the  form  of  an  equation: 

V  (Xi,  X2 . Xn)  =  V!  (X] )  +  V2  (X2)  +  ...  +  Vn  (Xn) 

where  V  is  the  prognostic  score,  X],  X2,  ...,  Xn  is  a  list  of  medical  attributes  that 
describe  a  person,  and  Vi,  V2,  ...,  Vn  are  numerical  descriptions  of  the  relative 
importance  or  "weight"  of  each  attribute.  In  terms  of  clinical  prognostic 
stratification,  additive  value  functions  are  designed  to  quantify  the  relative 
importance  that  each  medical  attribute  makes  toward  general  clinical 
prognosis.  The  form  of  this  equation  implies  one  important  point.  Namely, 
that  each  medical  attribute  included  in  the  equation  has  a  contribution  to  the 
formulation  of  medical  prognosis,  and  is  independent  of  the  contributions 
made  by  the  other  medical  attributes. 

One  specific  type  of  prognostic  scoring  system  has  been  termed  a 
severity  of  illness  index.  Like  all  prognostic  scoring  systems,  severity  of 
illness  indices  strive  to  group  patients  based  on  known  attributes,  and  are 
analytically  formulated  on  additive  value  functions.  Severity  of  illness 
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indices  are  becoming  increasingly  popular  in  medical  care  for  two  reasons. 
First,  the  capacity  to  do  the  complex  data  analysis  required  by  sophisticated 
severity  of  illness  index  software  has  increased  dramatically  during  the  last 
decade.  And  second,  as  political  and  economic  attention  has  been 
increasingly  focused  on  the  current  "health  care  crisis"in  the  United  States, 
more  data  has  been  necessary  for  outcome  comparisons.  In  keeping  with  the 
growing  emphasis  on  outcome  comparisons  and  "case-mix"  data  needed  for 
such  comparisons,  severity  of  illness  indices  will  be  the  focus  of  the 
remainder  of  this  paper. 

Because  severity  of  illness  indices  are  a  specific  class  of  prognostic 
scoring  system  and  are  based  on  additive  value  functions,  an  obvious 
question  arises  concerning  the  formulation  of  indices:  "How  are  the  specific 
medical  attributes  (Xi  ...  Xn)  and  their  relative  importance  (V]  ...  Vn) 
determined?"  There  are  basically  two  methods  to  accomplish  this  task.  One 
is  to  base  the  determination  on  clinical  judgement;  the  other  is  to  base  the 
determination  on  statistical  methods  of  analysis.  Oftentimes,  a  severity  of 
illness  index  will  be  created  based  upon  clinical  judgment,  and  later  refined 
based  upon  statistical  analysis. 

As  reviewed  in  a  report  by  the  Institute  of  Medicine,22  three  methods 
of  formulating  group  judgment  exist.  The  three  methods  are  extremely 
similar  in  that  they  all  use  subjective  and  arbitrary  assessments  to  form  the 
basis  of  their  decisions.  They  only  differ  on  the  manner  in  which  the  final 
group  judgment  is  formulated.  One  method,  the  Delphi  technique,  arrives  at 
its  final  formulation  based  a  survey  process  of  isolated,  non-interacting, 
participants.  The  survey  process  elicits  anonymous  opinions  from  members 
of  a  selected  expert  group  by  formal  questionnaire  or  individual  interview. 
Summarized  anonymous  feedback  is  then  provided  to  the  group  members 
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and  opinions  may  again  be  elicited.  The  group  response  is  formulated  by 
aggregation  of  the  individual  opinions  during  the  final  round  of  questioning. 
The  major  advantage  of  the  Delphi  technique  is  that  the  participants  are  not 
in  direct  contact  with  one  another,  and,  thus,  "variables  of  professional  status 
and  personality  have  little  chance  to  influence  opinions  as  they  might  in  face- 
to-face  meetings...."22  Conversely,  the  major  disadvantage  of  the  Delphi 
technique  is  that  the  opportunity  for  the  clarification  of  ideas  among 
participants  is  lacking. 

Nominal  group  technique  (NGT),  differs  from  the  Delphi  technique  in 
that  the  participants  are  not  isolated  from  one  another.  Using  NGT, 
participants  express  their  opinions  anonymously  (either  by  writing  or  by 
computer  terminal),  and  the  opinions  are  considered  by  the  group. 
Clarification  of  opinions  may  be  given  (also  anonymously).  Further  rounds 
of  iteration  and  clarification  of  opinions  may  be  used  as  needed.  The  final 
formulation  of  group  judgment  is  established  by  anonymous  voting  or 
ranking.  The  advantage  of  NGT,  like  the  Dephi  technique,  is  that  effects  of 
personal  status  and  group  hierarchical  structure  are  minimized.  However, 
NGT  is  superior  over  the  Delphi  technique  in  that  it  allows  for  the 
clarification  of  ideas  among  participants. 

The  third  method  of  arriving  at  group  judgment  is  the  consensus 
method.  The  hallmark  of  the  consensus  method  is  its  reliance  on  open 
discussion  among  participants  who  know  one  another.  Consensus 
conferences,  employed  by  the  National  Institutes  of  Health,  are  characterized 
by  an  initial  presentation  of  information,  followed  by  open  discussion,  then 
the  drafting  of  consensus  answers,  further  discussion  and  comments  about 
the  consensus  answers,  and  incorporation  of  the  comments  in  a  final 
consensus  statement.  Obviously,  the  consensus  method  does  not  limit  the 
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influence  of  social,  personal,  or  hierarchical  factors  on  the  formulation  of  the 
final  group  judgment.  However,  it  may  encourage  discussion  to  a  greater 
extent  than  NGT. 

In  contrast  to  the  use  of  group  judgment,  methods  of  statistical  analysis 
use  objectively  derived  information  as  the  basis  for  the  inclusion  of  attributes 
and  their  relative  importance  within  the  severity  of  illness  index.  The 
relevant  information  is  derived  from  multivariable  analysis  to  determine 
"statistically  significant"  associations  between  attributes  and  outcomes.  After 
these  determinations  are  made,  statistically  significant  variables  are  included 
within  the  index,  with  their  relative  importance  correlated  to  the  degree  of 
their  statistical  significance.  The  analysis  is  most  commonly  done  by  one  of 
two  methods:  either  multiple  logistic  regression  or  by  proportional  hazards 
regression  (also  known  as  Cox  regression).  Although  a  detailed  description  of 
these  methods  of  statistical  analysis  are  beyond  the  scope  of  this  paper,  they 
are  covered  in  standard  textbooks  on  statistics  and  have  been  thoroughly 
discussed  in  a  recent  journal  article).23 

Though  statistical  analysis  is  the  more  superficially  credible  method 
(due  to  its  mathematical  rigor)  in  the  development  of  severity  of  illness 
indices,  some  of  the  most  widely  accepted  indices  were  developed  using 
clinical  judgment  as  the  sole  means  of  deriving  the  indices'  formulations. 
One  of  the  most  widely  recognized  and  used  severity  of  illness  indices  is  the 
Apgar  Score.  This  index  was  developed  in  1953  for  the  evaluation  of 
newborn  infants  using  subjective  clinical  judgment  to  create  the 
mathematical  formulation  of  the  index.  Another  clinically  derived  index 
which  has  attained  broad  acceptance  and  use  is  the  Glascow  Coma  Score 
which  was  designed  to  quantify  degree  of  brain  injury  and  improve 
prognostic  accuracy  in  affected  patients. 
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Beginning  in  the  early  1970's  and  continuing  through  the  1980’s,  as 
computer  data  analysis  capacity  increased  and  as  the  field  of  health  care 
administration  and  research  increasingly  focused  attention  on  the  need  for 
more  reliable  and  objective  means  of  analyzing  health  care  outcomes,  greater 
numbers  of  severity  of  illness  indices  were  developed.  Some  indices  were 
created  for  patients  with  specific  diseases  (burns,  liver  disease,  schizophrenia, 
gastrointestinal  hemorrhage,  sepsis,  scleroderma,  drug  overdose,  acquired 
immunodeficiency  syndrome,  etc.).  This  has  been  evident  especially 
concerning  cardiac  disease.24*30  Other  indices  have  been  designed  specifically 
for  certain  patient  groups.  Patients  with  injuries  from  trauma  and,  in 
particular,  automobile-related  trauma  have  been  the  subject  of  many  severity 
of  illness  indices  (Figure  l).31*37 


Figure  1:  Early  Trauma  Indices 

Abbreviated  Injury  Scale  (1971)31 
Trauma  Index  (1971  )32 
Comprehensive  Injury  Scale  (1972)33 
Injury  Severity  Score  (1974)34 
Modified  Trauma  Index  (1974)33 
Triage  Index  ( 1 974)'3^ 

Estimated  Survival  Probability  Index  (1978)37 


As  briefly  mentioned  before,  one  area  of  health  care  in  which  health 
care  researchers  have  fashioned  numerous  severity  of  illness  indices  in  the 
intensive  care  unit  (ICU)  (Figure  2). 33-43  This  is  so  for  numerous  reasons.  For 
one,  intensive  care  units  (ICU’s)  are  increasingly  used  as  the  treatment  setting 
of  choice  for  critically  ill  patients.  Beginning  in  the  1960’s  as  hospital  centers 
for  emergency  cardiac  care,  ICU’s  have  spread  rapidly.  By  the  mid-1970's,  as 
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prognostic  scales  were  proliferating,  it  was  reported  that  seven  out  of  every 
one  thousand  patients  were  transferred  to  an  ICU  for  specialized  care,  and  it 
was  projected  that  nationally  over  seven  million  patient  days  were  spent  in 
ICU's  annually.39 


Figure  2:  Intensive  Care  Treatment  Indices 

Therapeutic  Intervention  Scoring  System  (1974)3<s 
Condition  Index  Score  (1981  )39 

Acute  Physiology  and  Chronic  Health  Evaluation  (APACHE)  (1981  )49 
Simplified  Acute  Physiology  Score  (1984)41 
Mortality  Prediction  Model  (1985)43 
APACHE  11  (1985)43 

Organ  System  Failure  Based  Model  (1991)44 
APACHE  III  (1991  )45 


Second,  ICU’s  are  the  most  costly  site  of  medical  care  because  they  are 
the  main  locus  in  which  acute,  technologically  advanced  and  expensive  care 
is  given.  Heroic,  life-sustaining  treatment  is  readily  available  and  is  often 
used  in  ICU’s  to  keep  patients  alive  while  in  the  midst  of  their  acute  illness. 
This  fact,  coupled  with  the  large  volume  of  patients  who  are  treated  in  ICU’s, 
accounts  for  the  exponential  growth  of  ICU  expenditure.  In  1992,  intensive 
care  services  cost  approximately  62  billion  dollars  and  accounted  for  nearly  8% 
of  projected  national  health  care  spending46  and  approximately  1%  of  the 
United  States'  GNP.47  Because  of  the  vast  quantities  of  money  being  spent  in 
ICU's,  it  became  imperative  that  cost-efficient  diagnosis  and  treatment  plans 
be  developed  through  risk  adjusted  outcome  assessment. 

A  third  reason  that  intensive  care  was  well  suited  for  prognostic 
stratification  research  was  that  ICU  outcomes  were  known  to  vary  widely 
between  different  ICU's,  but  little  was  known  as  to  the  cause  of  the  variation. 
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One  review  article  reported  mortality  rates  in  surgical  ICU's  to  range  from  2% 
to  44%.4^  With  the  tools  available  in  the  late  1960’s  and  early  1970's,  it  was 
impossible  to  ascertain  whether  this  outcome  variation  was  due  to  differences 
in  therapeutic  efficiency  (the  quality  of  the  medical  care)  or  the  differences  in 
patient  case  mix  (the  degree  of  initial  severity  of  illness  and  comorbidity). 

Fourth,  intensive  care  was  (and  still  is)  characterized  by  a  great  degree 
of  patient  monitoring,  laboratory  use,  and  the  routine  daily  collection  and 
recording  of  a  large  amount  of  detailed  clinical  information.  Consequently, 
researchers  interested  in  medical  outcome  analysis  had  a  large  amount  of 
readily  available  data  with  which  to  develop  more  sophisticated  predictive 
models  to  measure  patient  case  mix. 

Consequently,  due  to  the  combination  of  these  four  characteristics  of 
intensive  care  medicine  (a  large  volume  of  patients,  an  exponentially 
increasing  use  of  financial  resources,  great  variance  in  medical  outcomes,  and 
readily  available  clinical  and  laboratory  data),  medical  outcome  research  was 
focused  on  intensive  care  from  an  early  date.  In  this  setting,  many  research 
tools  were  developed  to  allow  a  more  meaningful  evaluation  or  analysis  or 
intensive  care  data.  Many  prognostic  scoring  systems  and  severity  of  illness 
indices  were  developed  with  the  purpose  of  explaining  some  of  the  variance 
known  to  exist  in  intensive  care  outcomes  and  allowing  risk-adjusted 
comparisons  of  medical  outcomes. 


APACHE 


One  index  which  was  designed  to  help  explain  the  variance  of 
outcomes  in  ICU’s  is  the  Acute  Physiology  and  Chronic  Health  Evaluation 
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(APACHE)  prognostic  scoring  system.40  This  prognostic  system  was 
developed  by  William  A.  Knaus,  M.D.  and  colleagues  at  the  George 
Washington  University  Medical  Center.  In  1978,  with  a  grant  by  the  U.S. 
Health  Care  Financing  Administration  (HCFA),  Knaus  et.  al.  began 
developing  APACHE  to  provide  "an  improved  method  for  estimating  the 
pretreatment  risk  of  death  of  acutely  ill  hospitalized  patients...."44  In  other 
words,  one  of  the  purposes  of  the  APACHE  system  was  to  provide  a  means  to 
improve  clinical  prognostic  accuracy. 

In  addition  to  its  predictive  function,  APACHE  was  designed  with  an 
explicative  purpose.  APACHE  was  designed  to  explain  outcome  variance 
(differences  in  in-hospital  mortality  rates)  by  "adjusting"  for  the  patients' 
initial  composite  severity  of  illness.  Thus,  APACHE  could  provide  an 
objective  means  to  assess  patient  care  outcomes  at  different  institutions  or 
within  the  same  institution  at  different  points  in  time  or  among  different 
patient  groups.  Knaus  and  his  colleagues  felt  that  APACHE  could  have 
analytic  use  by  improving  "the  ability  of  nonrandomized  study  designs  to 
produce  convincing  evidence  of  efficacy."40  The  essence  of  the  APACHE 
severity  of  illness  index  was  that  it  could  objectively  quantify  the  degree  of 
patient  illness,  and  thus,  could  provide  "an  answer  to  the  nagging  question  of 
most  nonrandomized  studies:  'Were  the  two  patient  groups  similar?'"40 

The  conceptual  approach  of  APACHE  toward  measuring  a  patient's 
severity  of  illness  included  two  areas  of  assessment:  a  physiological  score 
representing  acute  illness  and  a  preadmission  health  evaluation  indicating 
health  status  before  acute  illness.  The  acute  physiology  score  (APS)  was 
calculated  by  measuring  the  most  abnormal  value  of  each  of  thirty-four 
physiologic  variables  (blood  pressure,  heart  rate,  serum  pH,  serum  sodium, 
etc.)  during  the  first  thirty-two  hours  of  ICU  admission.  Each  variable  and  its 


19 


relative  influence  on  the  APS  were  determined  by  consensus  methods.  The 
variables  were  chosen  to  represent  a  derangement  in  one  of  the  body's  seven 
major  physiological  systems  (neurological,  cardiovascular,  respiratory, 
gastrointestinal,  renal,  metabolic,  and  hematological)  and  were  assigned  a 
weight  ranging  from  zero  to  four  depending  on  the  degree  of  derangement: 
the  more  abnormal  the  measured  variable,  the  greater  the  weight  assigned. 
Because  the  APS  was  solely  the  sum  of  the  measured  variables,  it  followed 
that  the  more  abnormal  the  measured  variables,  the  higher  would  be  the 
APS,  thus  representing  a  more  acutely  ill  the  patient. 

The  second  area  of  assessment,  the  preadmission  health  status,  was 
included  within  APACHE  so  as  to  differentiate  patients  with  similar  acute 
physiology,  yet  with  different  abilities  to  "fight  off  death"  and  recover.  In  the 
original  APACHE  this  assessment  was  based  on  the  patient's  functional 
status,  productivity,  and  degree  of  medical  attention  sought  six  months  prior 
to  admission.  The  patient  was  assigned  a  letter  designation  (A,  B,  C,  or  D)  to 
correspond  to  a  physiologic  reserve  ranging  from  good  prior  health  (A)  to 
severe  restriction  of  activity  due  to  disease  (D). 

Although  APACHE  was  a  fairly  reliable  means  of  stratifying  ICU 
patients  according  to  their  severity  of  illness  and  risk  of  subsequent  hospital 
death,  APACHE  proved  to  be  too  complex  for  routine  clinical  use.49 
Consequently,  Knaus  and  colleagues  reevaluated  APACHE,  made  some 
necessary  changes,  and  called  the  modified  system  APACHE  II  (Figure  3). 
Whereas  the  original  APACHE  score  was  based  on  two  components  (the  APS 
and  the  preadmission  health  status),  the  APACHE  II  score  was  the  numerical 
sum  of  three  components: 


APACHE  II  Score  =  APS  +  Age  points  +  Chronic  Health  points. 
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In  the  area  of  assessing  the  degree  of  acute  illness,  the  APACHE  acute 
physiology  score  was  judged  to  incorporate  too  many  physiologic  variables, 
some  of  which  were  not  routinely  measured  (serum  osmolarity,  serum  lactic 
acid,  skin  anergy  testing,  etc.)  or  added  little  explanatory  power  (ie.,  serum 
glucose,  urinary  output,  albumin,).43  By  using  a  combination  of  consensus 
methods  (based  on  clinical  judgment)  and  methods  of  statistical  analysis,  the 
thirty-four  APACHE  variables  were  reduced  to  twelve  to  form  the  APACHE  II 
APS.  Another  change  to  the  APS  consisted  of  revising  the  weighting  system. 
Although  most  variables  were  weighted  in  the  same  manner  as  before  (from 
zero  to  four,  with  a  weight  of  four  signifying  the  greatest  physiologic 
derangement)  three  variables  were  given  different  calculation  parameters, 
namely,  serum  creatinine  during  acute  renal  failure,  the  Glascow  coma  score 
as  a  measure  of  neurologic  function,  and  oxygenation  assessment  was 
quantified  with  consideration  of  inspired  oxygen  concentration  (FiCb).  As 
well,  in  a  slight  change  from  the  original  APACHE  system,  the  recorded  value 
of  each  physiologic  variable  was  based  on  the  most  abnormal  reading  during 
the  patient’s  first  twenty-four  (instead  of  thirty-two)  hours  in  the  ICU. 

The  assessment  of  prior  health  status,  though  conceptually  the  same  as 
in  the  original  APACHE,  was  more  specifically  defined.  The  new  system's 
prior  health  assessment  considered  patient  age  (a  graded  scale  ranging  from 
zero  to  six,  with  advanced  age  being  accorded  more  points),  and  chronic 
health  evaluation.  The  chronic  health  evaluation  was  based  on  both  chronic 
organ  system  insufficiency  (either  hepatic,  cardiovascular,  respiratory,  renal, 
or  immunologic)  and  operative  status  (nonoperative,  emergency 
postoperative,  or  elective  postoperative). 
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The  major  modification  incorporated  into  the  APACHE  system  was  a 
classification  of  disease  process.  This  was  done  with  the  realization  that 
patients  with  equal  acute  physiologic  derangement  but  with  different 
pathophysiologic  processes  are  clinically  expected  to  have  unequal  mortality 
risks.  As  an  example:  two  patients,  both  with  identical  pathophysiologic 
abnormalities  (poor  oxygenation  and  acidemia)  are  admitted  to  the  ICU  for 
respiratory  support.  One  patient  is  admitted  due  to  asthma;  the  other  due  to 
an  advanced  lung  neoplasm.  Although  their  acute  physiologic  derangements 
are  identical,  these  patients  have  significantly  different  probabilities  of  dying. 
Because  the  latter  patient  has  a  much  more  life  threatening  disease,  his  risk  of 
death  is  much  higher.  In  order  to  incorporate  this  reasoning,  the  modified 
APACHE  assigned  each  patient  to  a  diagnostic  category  and  assigned  each 
diagnostic  category  its  own  weighting  used  to  predict  mortality. 

The  APACHE  II  scoring  system  retained  the  same  conceptual 
framework  of  the  original  APACHE  system  in  that  each  patient's  total 
APACHE  score  was  computed  by  summing  the  APS,  the  age  points,  and  the 
chronic  health  points.  The  range  of  possible  APACHE  II  scores  is  from  zero  to 
seventy-one  with  a  higher  score  associated  with  a  greater  likelihood  of  death. 
At  the  time  of  original  publication  describing  the  APACHE  II  system,  the 
highest  actual  patient  score  recorded  was  fifty-five.  Although  the  APACHE  II 
score  could  be  used  for  patient  classification  and  stratification  within 
treatment  groups,  it  alone  could  not  be  used  to  predict  medical  outcomes 
(death  versus  survival).  The  APACHE  II  score  was  only  one  factor  used  in 
the  calculation  of  predicted  mortality  risk. 

APACHE  II  was  able  to  prognosticate  in-hospital  mortality  rates  by 
incorporating  disease  process  classification  into  its  patient  assessment. 
Specifically,  APACHE  II  placed  patients  into  one  of  fifty-three  diagnostic 
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categories  (twenty-nine  being  nonoperative,  and  twenty-four  being 
postoperative)  according  to  the  reason  for  ICU  admission.  Each  category  was 
assigned  a  diagnostic  category  weight  (DCW),  and  predicted  risk  of  subsequent 
hospital  death  could  be  calculated  by  using  the  following  multiple  logistic 
regression  equation  : 

In  (R/l-R)=  -3.517+  .146  (APACHE  II  score)+  DCW  +  .603  {only  if 
emergency  postoperative}, 

where  R  is  the  risk  of  death. 

It  should  be  noted  that  the  APACHE  II  system  of  risk  prediction, 
though  based  on  an  equation  used  to  predict  mortality  risk  for  individual 
patients,  was  not  designed  to  be  used  for  individual  patient  risk  prediction. 
The  APACHE  II  was  developed  to  predict  mortality  risk  for  groups  of  patients. 
(Individual  patient  prognosis  was  felt  to  be  an  unrealistic  goal  for  two 
reasons:  one,  because  of  the  great  variation  among  individual  patients  in 
terms  of  hospital  course  and  pre-existing  disease,  and  two,  because 
"physicians  are  ...  reluctant  to  accept  estimated  probabilities  as  the  basis  of 
limiting  or  stopping  treatment."40  (In  order  to  maintain  the  accuracy  of 
APACHE  II,  it  was  suggested  that  disease-specific  mortality  predictions  must 
be  derived  on  a  minimum  of  fifty  patients  in  each  diagnostic  category,  with  at 
least  twenty  patients  in  the  least  frequent  outcome  category.)  Though  not 
designed  to  substitute  for  clinical  judgment  regarding  individual  patient 
outcome  prediction  or  treatment  decisions,  APACHE  II  could  be  used  as  a 
source  of  information  for  clinicians  and  as  a  tool  for  analysis  of  medical 
outcomes.  As  well,  it  should  be  realized  that  APACHE  II  was  designed  as  a 
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means  of  prospective  health  assessment  and  risk  stratification.  Thus, 
APACHE  II  measurements  (physiologic  variable  readings)  and  classification 
decisions  (placement  of  patients  into  disease  categories)  were  made  on  a  real 
time  basis  without  the  advantage  of  hindsight. 

The  APACHE  II  prognostic  scoring  system  was  validated  in  a  study  of 
5,815  ICU  admissions  at  thirteen  different  hospitals.43  A  direct  relationship 
between  APACHE  II  scores  and  observed  hospital  mortality  rate  was  found  to 
be  statistically  significant  through  the  entire  range  of  scores  (greater  APACHE 
II  scores  were  associated  with  higher  mortality  rates).  Direct  cross-tabulations 
and  multivariate  techniques  were  used  to  compare  the  predictive  power  of 
APACHE  II  with  the  original  APACHE  system  and  examine  the  relative 
importance  of  the  components  of  APACHE  II.  This  analysis  showed  that 
APACHE  II  had  better  explanatory  power  than  the  original  APACHE  as 
measured  by  model  chi-square,  percent  of  cases  correctly  identified,  R-squared, 
rank  correlation  between  outcome  and  predicted  probability,  and  area  under 
ROC  curve  (Table  1).  The  overall  correct  classification  rate  (classifying 
patients  as  either  predicted  to  die  or  predicted  to  survive)  was  eighty-six 
percent  (using  a  predicted  mortality  cutoff  point  of  0.50)  (Table  2). 

In  general,  APACHE  II  has  been  quite  a  success  as  is  evident  by  its 
widespread  use  by  other  researchers.  The  APACHE  II  system  has  been 
incorporated  into  independent  studies,  in  research  related  to  both  clinical 
decision-making  and  health  policy. 7>49  As  well,  in  the  clinical  realm  of 
medicine,  hospitals  have  used  APACHE  II  within  their  ICU’s  as  the  basis  for 
utilization  and  outcome-based  quality  management  programs,  and  some 
foreign  countries  have  even  adopted  APACHE  II  as  the  prognostic  scoring 
system  in  all  of  their  ICU’s.49 
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Table  1:  Measures  of  Aggregate  Explanatory  Power 


APACHE* 

APACHE  11 

Model  Chi-square 

1537.1 

1634.5 

R-squared 

.310 

.319 

Rank  correlation  (between 
outcome  and  predicted  probability) 

.730 

.739 

Area  under  ROC  curve 

.851 

.863 

Data  presented  in  "APACHE  II:  A  Severity  of  Disease  Classification  System"  by 
Knaus  et.  al. 


Table  2:  Outcome  Classification  (APACHE  vs.  APACHE  II) 


APACHE* 

(N=582) 

APACHE  II 
(N=5030) 

Sensitivity  (%) 

49.0 

47.0 

Specificity  (%) 

97.3 

94.9 

PPV  (%) 

79.4 

69.6 

NPV  (%) 

90.0 

87.9 

Correct  (%) 

88.8 

85.5 

A  cutoff  of  0.50  was  used  to  divide  the  two  predicted  groups.  Calculated  mortality  risk 
>  0.50  implied  predicted  death;  calculated  mortality  risk  <  0.50  implied  predicted 
survival. 

Data  presented  in  "APACHE  II:  A  Severity  of  Disease  Classification  System"  by 
Knaus  et.  al. 
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However,  despite  widespread  utilization  of  APACHE  II  by  independent 
research  groups,  there  still  exists  some  controversy  about  its  role  in  case-mix 
risk  adjustment,  severity  of  illness  stratification,  and  mortality  prediction. 
There  have  been  both  prospective  and  retrospective  studies  of  APACHE  II's 
utility  when  applied  to  specific  patient  groups  or  as  compared  to  other 
prognostic  scoring  systems.  APACHE  II  has  been  studied  in  regard  to  patients 
receiving  total  parenteral  nutrition,50  patients  with  cardiogenic  pulmonary 
edema,51  with  hematologic  malignancies,52  with  perforated  ulcers,55  with 
acute  pancreatitis,54  with  breast  cancer,55  after  cardiac  arrest,56  with  acute 
trauma,57  with  intra-abdominal  abcesses,58  and  (as  will  be  discussed  below)  in 
patients  with  acquired  immunodeficiency  syndrome  (AIDS).59'62  Several 
studies  highlight  APACHE  II's  lack  of  advantage  over  clinical  judgment  in 
terms  of  predictive  accuracy,63'65  while  another  study  brings  attention  to  the 
lead-time  bias  (caused  by  measuring  patients  attributes  at  varying  times, 
including  after  initiation  of  therapy)  inherent  in  the  APACHE  II  system.66 
One  research  group  documented  APACHE  II's  inaccuracy  in  predicting 
surgical  ICU  parameters  such  as  length  of  stay  and  total  hospital  costs,67 
though  this  should  not  be  suprising  since  APACHE  II  was  designed  to  predict 
and  explain  mortality  rates,  not  utilization  measures. 

Although  these  numerous  publications  have  pointed  out  APACHE  II's 
failings,  they  are,  at  the  same  time,  a  reference  to  APACHE  II's  strengths  as  a 
prognostic  scoring  system  and  act  as  testimony  to  the  importance  APACHE  II 
has  achieved  in  the  field  of  outcome  prediction,  risk  stratification,  and  quality 
management. 


I 

i 
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Prognostic  stratification  in  patients  with  AIDS  or  HIV  seropositivity 

Many  publications  concerning  patients  with  AIDS  have  focused  on  the 
relationship  of  patient  attributes  (CD4+  T-cell  counts,  method  of  HIV 
infection),  diseases,  and  therapies.  Much  of  the  outcome  research  on  patients 
with  AIDS  has  been  directed  primarily  on  the  documentation  of  mortality 
rates  and  the  determination  of  prognostic  factors.  Numerous  papers  have 
documented  the  outcomes  of  patients  hospitalized  with  AIDS,  most  especially 
patients  with  Pneumocystis  carinii  pneumonia  (PCP).  In  the  early  years  of  the 
AIDS  epidemic  (before  1988),  studies  consistently  reported  poor  survival  rates, 
particularly  in  patients  with  respiratory  failure  necessitating  mechanical 
ventilation.  Most  of  these  patients  had  PCP,  and  reported  mortality  rates 
ranged  from  86%  to  100%/ 5,6,9,68,69  \n  1988,  mortality  rates  were  noted  to 
drop.  One  study  reported  mortality  among  patients  with  first  episodes  of  PCP 
to  be  58%. 8  Since  this  initial  report  of  improved  outcome,  three  studies  have 
reported  in-hospital  mortality  rates  (in  AIDS  patients  with  pneumonia 
requiring  mechanical  ventilation)  that  confirm  this  lower  mortality  rate.4'7'16 

Following  this  trend  in  improved  in-hospital  survival  in  AIDS 
patients  with  pneumonia  (either  due  to  P.  carinii  or  another  pathogen),  out¬ 
patient  survival  rates  were  reported  to  be  improving  as  well.  In  1990,  two 
reports  documented  improved  short-term  (1-year)  out-patient  survival  rates 
from  1984  through  1987. Hdi  By  1991,  in  a  study  of  AIDS  patients  who  were 
admitted  with  acute  respiratory  failure  due  to  PCP  between  January  1,  1987 
and  December  31,  1990,  in-hospital  mortality  was  53%  and  1-year  mortality 
was  63%. 13 

In  addition  to  longitudinally  tracking  mortality  rates,  researchers  have 
focused  their  attention  on  the  elucidation  of  prognostic  factors  for  patients 
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with  AIDS.  Numerous  clinical  and  laboratory  variables  have  been 
documented  to  have  prognostic  value  in  stratifying  AIDS  patients.  In  1987, 
Brenner  et.  al.12  linked  poor  outcomes  in  patients  with  PCP  to  five 
determinants:  (1)  severe  abnormalities  on  initial  chest  radiographs,  (2) 
alveolar-arterial  oxygen  gradient  (AaPO?)  greater  than  30  mm.  Hg,  (3)  severity 
of  interstitial  edema  on  initial  transbronchial  biopsy,  (4)  the  persistence  of 
Pneumocystis  cysts  on  follow-up  biopsy,  and  (5)  diagnosis  of  PCP  before  July, 
1985.  Brenner  et.  al.  also  reported  that  total  leukocyte  counts  has  no 
prognostic  significance  for  either  acute  or  long-term  life  expectancy. 

In  1988,  El-Sadr  and  Simberkoff8  reported  that  survival  of  patients  with 
PCP  was  predicted  by  a  constellation  of  clinical  and  laboratory  findings:  a  short 
duration  of  symptoms  (most  commonly  shortness  of  breath,  cough,  fever, 
chest  pain,  and  weight  loss),  better  arterial  oxygenation  on  admission,  and  a 
decrease  in  serum  lactate  dehydrogenase  (LDH)  and  an  improvement  in 
arterial  blood  gas  determinations  after  institution  of  mechanical  ventilation. 
In  1991,  Peruzzi  et.  al.70  reported  that,  in  patients  with  PCP,  the  presence  of 
metabolic  acidemia  (a  pH  <  7.35  and  a  base  deficit  >  4  mEq/L)  and  prolonged 
positive  end-expiratory  pressure  (PEEP)  >  10  cm.  H2O  were  poor  prognostic 
signs.  Peruzzi  et.  al.,  in  disagreement  with  Brenner  et.  al.,  reported  that  chest 
radiographic  findings  did  not  predict  mortality  in  this  group  of  patients. 

Serum  LDH  was  determined  to  be  of  prognostic  value  in  patients  with 
PCP  by  several  groups  of  researchers.  In  1988,  Lipman  and  Goldstein71 
presented  evidence  that  serum  LDH  in  PCP  patients  had  high  prognostic 
value.  Serum  LDH  values  greater  than  520  IU  had  a  100%  sensitivity  of 
predicting  death  and  a  specificity  of  61%.  These  results  were  confirmed  by  two 
groups  of  researchers.  Kagawa  et.  al.,  though  primarily  studying  the  use  of 
serum  LDH  in  the  diagnosis  of  PCP,  concluded  that  serum  LDH  "correlates 
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with  the  severity  of  [PCP  patients']  oxygen  transfer  disorder."72  Garay  and 
Greene  determined  that  serum  LDH  levels,  AaPCb  gradients,  and  mean 
respiratory  rates  had  prognostic  value  for  patients  with  PCP.77 

Of  note,  Garay  and  Greene  found  no  significant  difference  between  PCP 
survivors  and  nonsurvivors  "with  respect  to  duration  of  symptoms,  presence 
of  physical  findings  (other  than  respiratory  rate),  and  admission  leukocyte  or 
lymphocyte  counts."73  This  is  in  agreement  with  Brenner's  findings 
concerning  leukocyte  counts  and  in  disagreement  with  El-Sadr  and 
Simberkoff’s  results  concerning  symptom  duration. 

Besides  the  determination  of  individual  prognostic  factors  for  patients 
with  AIDS  (predominantly  with  PCP),  two  separate  prognostic  scoring 
systems  have  been  developed  for  patients  with  AIDS.  The  AIDS  Prognostic 
Staging  System,74  created  by  Justice  and  Feins tein  in  1989,  was  designed  to 
stratify  hospitalized  and  non-hospitalized  AIDS  patients  according  to 
predicted  long-term  mortality.  The  system,  developed  and  tested  on  a  cohort 
of  117  AIDS  patients  between  October,  1981  and  July,  1987,  divides  patients 
into  three  stages  based  on  four  potential  forms  of  physiologic  deficit: 
nutritional,  respiratory,  neurologic,  or  hematologic.  One  point  is  given  for 
each  of  the  following:  severe  diarrhea  or  serum  albumin  <  2.0  g/dl,  any 
neurologic  deficit,  arterial  oxygen  tension  <  50  mm  Hg,  hematocrit  <  30 
percent,  lymphocyte  count  <  150/gl,  white  blood  cell  count  <  2500,  and 
platelet  count  <  140,000  (Table  3).  The  total  score  determines  the  assignment 
of  stage:  Stage  I  (0  points).  Stage  II  (1  point),  and  Stage  III  (2  to  7  points).  In  so 
far  as  the  initial  cohort  of  117  patients  with  AIDS,  the  AIDS  Prognosis  score 
proved  to  be  a  reliable  means  of  stratifying  patients  according  to  out-patient 
mortality:  patients  in  Stages  I,  II,  and  III  had  median  survival  times  of  11.6, 
5.1,  and  2.1  months,  respectively. 
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Table  3:  The  AIDS  Prognosis  Score 

Characteristic  Points  Awarded 


Nutritional 

—  Severe  diarrhea  and/or 

serum  albumin  <  2.0  g/dl  1 

Respiratory 

--  P02  <  50  mm  Hg  1 

Neurologic 

—  Any  manifestation  1 

Hematologic 

—  Lymphocytes  <  150  1 

—  Hematocrit  <  30%  1 

—  White  blood  cells  <  2,500  1 

—  Platelets  <  140,000  1 


The  second  AIDS-specific  prognostic  scoring  system  developed  was  the 
PCP  Severity  Score  (PSS).75  Created  by  Speich  et.  al.  in  1990,  the  PSS  was 
developed  on  a  cohort  of  59  nonintubated  PCP  hospitalized  patients.  It  was 
designed  to  stratify  this  group  of  patients  according  to  mortality  within  14 
days  aftPr  bronchoscopy.  The  score  is  constructed  by  the  degree  of  abnormality 
of  three  measures:  alveolar-arterial  oxygen  gradient  (AaPO?),  serum  LDH, 
and  percentage  of  brochoalveolar  lavage  (BAL)  neutrophils  (Table  4).  These 
variables  were  chosen  by  statistical  analysis  and  is  in  agreement  with  many  of 
the  previously  discussed  reports.  Variables  which  were  found  to  have  no 
statistical  significance  included  duration  of  symptoms,  hemoglobin,  total 
leukocyte  and  lymphocyte  counts,  and  C-reactive  protein.  It  should  be  noted 
that  Speich  et.  al.'s  finding  that  total  leukocyte  and  lymphocyte  counts  are 
not  prognostically  significant  are  in  agreement  with  the  results  reported  by 
Garay  and  Greene  but  are  in  disagreement  with  those  reported  by  Justice  and 
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Feinstein.  However,  the  disagreement  between  these  results  could  be  a 
function  of  the  different  outcomes  being  measured  (14  day  mortality  versus 
long  term  mortality).  The  PSS  was  validated  in  1992  in  a  prospective 
evaluation  of  94  patients  with  PCP.  In  this  follow-up  validation  study,  the 
PSS  displayed  an  overall  diagnostic  accuracy  of  95.7%. 


Table  4:  The  PCP  Severity  Score 


Points 

AaPCb  (mm  Hg) 

LDH  (U/l) 

BAL  neutrophils  (%) 

1 

>  20 

>  460 

>2 

2 

>  30 

>  920 

>5 

3 

>  40 

>  1,380 

>  10 

4 

>  50 

>  15 

5 

>  60 

>  20 

As  reviewed  above,  although  AIDS-specific  prognostic  scoring  systems 
have  been  developed  and  numerous  prognostic  factors  have  been 
determined,  there  is  disagreement  concerning  the  validity  of  individual 
prognostic  factors.  Therefore,  the  utility  of  these  systems  and  factors  is 
limited. 

In  contrast  to  the  AIDS-specific  scoring  systems  mentioned  above,  the 
APACHE  II  prognostic  scoring  system  has  been  used  extensively  and 
independently  validated.  However,  because  APACHE  II  was  devised  before 
the  rapid  spread  of  HIV  infection,  the  accuracy  of  APACHE  II  in  respect  to 
patients  with  AIDS  is  open  to  question.  By  September  1990  (the  date  of 
inception  of  the  current  study),  only  one  research  group  had  published  results 
of  applying  APACHE  II  to  patients  with  AIDS. 

Smith  et.  al.  published  the  results  of  their  research  in  October  1989. 59 
Analyzing  data  collected  retrospectively  from  the  medical  records  of  83  MICU 
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patients  with  AIDS  admitted  between  January  1984  and  September  1988,  they 
found  that  APACHE  II  significantly  underestimated  in-hospital  mortality 
(45.8%  (E)  vs.  63.9%  (O))  (Table  5).  The  inaccuracy  of  the  APACHE  II  mortality 
prediction  was  shown  to  be  due  solely  to  an  underprediction  of  mortality  in 
patients  with  Pneumocystis  carinii  pneumonia  (PCP)  requiring  mechanical 
ventilation.  In  patients  with  PCP  (n=37),  predicted  mortality  was  significantly 
less  than  observed  mortality  (44.3%  vs.  86.5%),  whereas  in  all  other  AIDS 
patients  without  PCP  (n=46)  predicted  and  observed  death  rates  were  similar 
(47.0%  vs.  45.7%).  The  similarity  of  predicted  and  observed  mortality  rates 
extended  to  AIDS  patients  without  PCP  who  required  mechanical  ventilation 
(n=17)  (72.3%  vs.  76.5%)  and  AIDS  patients  without  PCP  who  did  not  require 
mechanical  ventilation  (n=22)  (31.6%  vs.  36.4%). 


Table  5:  Comparison  of  APACHE  II  Predicted  and  Observed  Mortality  (Smith  et.  al.) 

Predicted  Mortality 

Observed  Mortality 

Group 

Rate,  % 

Rate,  % 

All  Non-AIDS  (n=166) 

34.1 

31.3 

Mech  vent  (n=25) 

48.1 

52.0 

Total  AIDS  (n=83) 

45.8 

63.9 

Mech  vent  &  PCP  (n=37) 

44.3 

86.5 

Not  mech  vent  &  PCP  (n=46) 

47.0 

45.7 

Mech  vent,  non-PCP  (n=17) 

72.3 

76.5 

No  mech  vent,  non-PCP  (n=22) 

31.6 

36.4 

Also,  it  was  reported  that  mean  APACHE  II  scores  did  not  differentiate 
survivors  from  nonsurvivors  (22.8  vs.  26.0)  nor  differentiate  survivors  from 
nonsurvivors  in  the  subset  of  mechanically  ventilated  patients  with  PCP  (23.4 
vs.  22.3)  (Table  6).  However,  when  the  subset  of  mechanically  ventilated 


patients  with  PCP  was  excluded,  APACHE  II  successfully  differentiated 
survivors  from  nonsurvivors  (22.6  vs.  31.6). 
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Table  6:  Comparison  of  APACHE  II  Scores  Between  Survivors  and  Nonsurvivors  (Smith  et.  al.) 


Group 

Survivors 

Nonsurvivors 

p  Value 

All  non-AIDS 

15.5  ±7.0 

27.2  ±  8.6 

<  0.01 

All  AIDS 

22.8  ±  5.7 

26.0  ±  9.1 

NS 

Mech  vent  &  PCP 

23.4  ±  4.0 

22.314.7 

>  0.05 

All  other  AIDS 

22.6  ±  6.1 

31.6  ±  11.3 

<  0.01 

Smith  et.  al.  explained  that  the  discrepancy  between  predicted  and 
observed  mortality  rates  in  patients  with  PCP  (and,  by  extension,  in  the  entire 
AIDS  population)  is  most  likely  due  to  the  lack  of  a  specific  APACHE  II 
diagnostic  category  coefficient  that  accurately  reflects  the  severity  of  PCP.  In 
other  words,  the  failure  of  APACHE  II  to  successfully  stratify  patients  with 
AIDS  was  caused  by  APACHE  II's  underestimation  of  the  severity  of  PCP. 

Because  the  patient  population  studied  by  Smith  et.  al.  was  composed 
exclusively  of  Veterans  (presumably  male  and  predominantly  not 
homosexual),  there  was  some  uncertainty  as  to  the  validity  of  the  results  in 
respect  to  a  more  typical  urban  population.  As  well,  because  Smith  et.  al. 
studied  a  population  admitted  to  the  MICU  between  January  1984  and 
September  1988  (prior  to  improved  PCP  survival  rates),  it  was  felt  that 
another  study  of  the  application  of  APACHE  II  to  HIV-positive  patients 
(including  patients  admitted  after  1988  with  the  prospect  of  improved 
survival  from  PCP)  was  warranted. 
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II.  METHODS 


Record  Selection 


The  current  study  involved  a  review  of  the  medical  records  of  all  HIV- 
positive  patients  admitted  to  the  Yale-New  Haven  Hospital  medical  intensive 
care  unit  (MICU)  during  a  five  year  period.  Records  were  selected  by  hospital 
discharge  date,  between  October,  1986  and  September,  1991.  The  patients  were 
identified  by  a  computerized  search  of  patient  records  by  the  ICD-9 
(International  Center  for  Disease,  ninth  edition)  coding  of  their  diagnoses. 
Patients  were  included  by  being  identified  as  having  either  AIDS,  AIDS- 
related  complex  (ARC),  HIV  seropositivity,  or  any  of  the  AIDS-associated 
opportunistic  diseases  ( Pneumocystis  carinii  pneumonia,  Kaposi's  sarcoma, 
progressive  multifocal  leukoencephalopathy,  or  central  nervous  system 
lymphoma,  etc.).  A  list  of  181  admissions  was  compiled.  Eleven  admission 
records  were  either  incomplete,  could  not  be  located  by  the  medical  records 
department,  or  were  otherwise  unavailable  for  review.  Of  the  remaining  170 
records,  eight  were  excluded  because  the  patients  had  AIDS-associated  diseases 
though  were  not  HIV  positive  and  did  not  have  AIDS  or  ARC.  (The  patients 
were  immunosuppressed  for  another  reason,  ie.  post  transplant,  malignancy, 
chronic  high-dose  steroid  use).  One  record  was  excluded  because  the  patient 
was  infected  with  HIV  type  II,  not  the  usual  HIV  type  I.  One  record  was 
excluded  because  the  patient  was  found  never  to  have  been  admitted  to  an 
ICU,  and  two  records  were  excluded  because  the  patients  had  been  admitted  to 
the  surgical  ICU,  not  the  medical  ICU.  Among  the  158  records  remaining,  it 
was  discovered  that  some  patients  had  multiple  admissions  to  the  MICU 
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which  were  not  identified  by  the  initial  computer  search.  These  MICU 
admissions  were  counted  as  separate  if  they  occurred  during  different  hospital 
admissions,  occurred  during  the  same  hospital  admission  but  at  least  four 
days  apart,  or  if  the  patients  were  admitted  to  the  MICU  for  different  reasons 
(if  the  patients  were  placed  in  different  diagnostic  categories  according  to 
APACHE  II  admission  criteria).  Three  records  fell  into  this  last  category  of 
being  absent  from  the  computer-generated  list.  Thus,  in  total,  161  admission 
records  were  utilized. 


Data  Collection 


Medical  records  were  reviewed  retrospectively  between  December,  1991 
and  June,  1992  with  the  approval  of  the  Human  Investigation  Committee. 
APACHE  II  data  were  obtained  from  multiple  information  sources  in  the 
patients'  charts  including  MICU  admission  notes,  progress  notes,  transfer 
notes,  MICU  nursing  admission  sheets,  emergency  room  admission  sheets, 
emergency  room  transfer  sheets,  paramedic  or  ambulance  transfer  sheets,  and 
laboratory  reports.  In  addition  to  information  required  for  APACHE  II 
scoring,  information  of  potential  or  reported  prognostic  value  was  recorded. 
This  included  HIV  risk  factor,  HIV  diagnosis  date,  leukocyte  counts  and 
differentials,  CD4+  counts,  orders  regarding  maximal  supportive  and 
resuscitative  care  ("Do  not  resuscitate"  and/or  "Do  not  intubate"  orders), 
identification  of  microbial  pathogens,  serum  lactate  dehydrogenase  levels, 
number  of  procedural  interventions,  location  of  treatment  prior  to  admission 
to  the  MICU,  and  chest  radiographic  findings. 
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All  information  necessary  for  APACHE  II  scoring  was  collected  as 
prescribed  by  Knaus  et.  al.  except  that  we  recorded  the  most  abnormal  value  of 
any  physiologic  variable  beginning  two  hours  prior  to  MICU  admission  and 
ending  22  hours  after  MICU  admission.  This  small  change  was  instituted  in 
order  to  account  for  the  previously  reported  problem  of  lead-time  bias76 
resulting  from  intensive  care  level  services  (intubation,  administration  of 
intravenous  cardiac  inotropes,  etc.)  being  delivered  before  the  patient  was 
physically  within  the  MICU  (in  other  words,  while  in  the  ambulance,  in  the 
emergency  room,  or  on  the  floor).  The  only  time  that  this  did  not  apply  was 
if  the  pafient  died  within  the  first  22  hours  within  the  MICU.  In  this  case,  the 
earliest  (though  not  prior  to  the  decision  to  admit  the  patient  to  the  MICU) 
recorded  variables  (not  the  most  abnormal),  were  used  for  the  calculation  of 
the  APACHE  II  acute  physiology  score.  (Otherwise,  heart  rate,  blood  pressure, 
respiration  rate,  and  Glascow  coma  scale  would  need  to  be  recorded  as  zero, 
thus  overestimating  severity  of  illness.)  Based  on  past  reports,77'7^  and  in  the 
method  of  Knaus  et.  al.,79  if  any  acute  physiologic  variable  recordings  were 
not  available,  it  was  assumed  that  the  variable  fell  with  in  the  normal  range. 

Although  APACHE  II  was  developed  as  a  prospective  scoring  system, 
our  time  constraints  made  retrospective  chart  review  more  feasible  for  our 
study.  Thus,  our  data  collection  differed  from  the  manner  used  by  Knaus  et 
al.  during  the  validation  study  for  APACHE  II.  However,  in  order  to 
maintain  agreement  with  the  validation  study,  we  only  recorded  data  which 
was  prospectively  available.  This  was  done  by  relying  on  information 
provided  by  the  patient's  bedside  clinicians  as  the  primary  data  for  diagnostic 
category  assignment.  However,  in  event  that  the  patient's  physician  did  not 
clearly  indicate  a  primary  diagnosis,  we  would  assign  a  diagnostic  category 
based  on  prospectively  available  information  (information  which  was 
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available  two  hours  prior  to  or  within  the  first  twenty-two  hours  of  MICU 
admission)  and  assisted  by  decisionmaking  algorithms. 

The  circumstance  in  which  clinicians  did  not  specify  a  primary 
diagnosis  was  not  unusual.  Typical  of  these  cases  were  patients  being 
admitted  with  such  diagnoses  as  "hypotension,  fever,  short  of  breath",  or 
"questionable  pneumonia,  rule  out  sepsis,"  or  "dyspnea,  fever,  rule  out 
congestive  heart  failure  versus  pneumonia,"  or  "abdominal  pain,  fever, 
questionable  sepsis  versus  gastrointestinal  perforation."  For  these  situations, 
prospectively  available  (available  to  the  original  clinician)  clinical 
information  such  as  previous  sputum  and  blood  culture  results,  concurrent 
Swan-Ganz  catheter  readings,  and  the  presence  or  absence  of  free  air  on  an 
upright  abdominal  X-ray  was  used.  This  is  not  to  say  that  the  actual  bedside 
clinician's  prospective  clinical  judgment  was  ever  ignored.  We  assigned 
diagnostic  categories  only  if  the  initial  clinical  assignment  of  diagnosis  was 
unclear  or  ambivalent. 

In  instances  of  similar  and  frequent  clinical  situations,  in  order  to 
facilitate  the  consistent  assignment  of  diagnostic  categories,  guidelines  were 
established  (as  suggested  by  Knaus  et.  al.)80  so  as  to  eliminate  variation  in  our 
retrospective  decisionmaking  process.  This  was  done  to  clarify  three  clinical 
scenarios.  One  involved  the  assignment  of  a  diagnostic  category  to  seizure 
patients  being  admitted  to  the  MICU.  If  the  patient  was  intubated  and 
mechanically  ventilated  in  the  MICU  (in  the  setting  of  a  prior  or  ongoing 
seizure),  the  patient  was  categorized  as  a  post  respiratory  arrest  patient.  If  the 
patient  was  not  intubated  due  to  the  prior  seizure,  though  still  transported  to 
the  MICU  for  observation  and  monitoring  of  his  or  her  neurologic  and 
respiratory  status,  then  the  patient  was  categorized  as  a  seizure  patient. 
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A  second  situation  in  which  decisionmaking  criteria  were  established 
involved  the  distinction  between  the  diagnostic  categories  of  "respiratory 
infection"  and  "sepsis."  (In  this  study,  every  patient  who  had  a  "respiratory 
infection"  had  pneumonia.  Thus,  for  the  remainder  of  this  report,  this 
diagnostic  category  will  be  referred  to  as  "pneumonia.")  In  this  situation,  as 
usual,  the  patients'  prospective  clinical  diagnosis  was  considered  first  and 
foremost.  If  a  question  still  remained  as  to  the  correct  diagnostic 
categorization,  prospectively-available  lab  results  were  next  considered,  and 
in  instances  of  continuing  uncertainty,  systolic  blood  pressure  (SBP)  and 
alveolar-arterial  oxygenation  gradient  (AaPCb)  was  examined.  In  general, 
previously  positive  blood  cultures,  a  low  SBP  (<  90  mm.  Hg.),  and  low  AaPCb 
were  factors  increasing  the  probability  that  the  patient  would  be  categorized  as 
septic.  Conversely,  negative  blood  cultures,  a  SBP  >  90,  and  a  high  AaPCb  led 
more  frequently  to  a  disease  categorization  of  pneumonia. 

A  third  situation  in  which  a  guideline  was  established  in  order  to 
reduce  inconsistency  involved  patients  who  had  had  emergency  life  support 
measures  taken  in  their  behalf  (patients  who  had  been  "coded").  When  faced 
with  the  decision  of  whether  to  classify  a  patient  who  had  a  respiratory  arrest 
which  then  progressed  to  a  cardiac  arrest,  it  was  decided  that  any  patient  who 
received  intravenous  cardiac  inotropes,  chronotropes,  or  mechanical 
stimulation  (chest  compressions)  would  be  categorized  as  a  post  cardiac  arrest 
patient.  If  the  patient  had  a  respiratory  arrest  (was  subsequently  intubated  and 
ventilated)  yet  did  not  require  nor  receive  cardiac  inotropes  he  or  she  would 
be  categorized  as  a  respiratory  arrest  patient. 

In  order  to  consistently  evaluate  and  determine  the  chronic  health 
status  of  individual  patients,  we  established  a  guideline  for  immunologic 
status  assessment.  This  was  done  because  it  was  felt  that  not  all  HIV-positive 
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patients  admitted  to  the  MICU  were  significantly  immunocompromised  nor 
would  their  HIV  seropositivity  greatly  effect  their  survival  from  their  acute 
illness  and  hospitalization.  Examples  of  this  situation  are  numerous:  a 
recently  diagnosed  and  asymptomatic  HIV-positive  patient  admitted  to  the 
MICU  because  of  a  non-immunologically  related  illness,  such  as  an  asthmatic 
crisis,  a  self-administered  drug  overdose,  or  trauma.  The  guidelines  are  as 
follows:  The  patient  was  felt  to  be  immunologically  compromised  (and  given 
points  for  a  chronic  disease  history)  if  (1)  he/she  were  HIV-positive  and  had  a 
history  of  Pneumocystis  carinii  pneumonia  (PCP),  Kaposi's  sarcoma  (KS), 
cytomegalovirus  (CMV)  retinits,  cryptococcal  meningitis,  or  any  other  AIDS- 
defining  disease,  (2)  he/she  were  HIV-positive  for  24  months  or  longer,  (3) 
he/she  were  diagnosed  as  HIV-positive  at  least  one  month  prior  to  hospital 
admission,  and  he/she  were  admitted  to  the  hospital  or  MICU  due  to 
infection,  lymphoma,  cachexia,  or  an  immunologically  mediated  illness. 
Conversely,  if  the  patient  were  diagnosed  as  HIV-positive  for  less  than  24 
months,  without  a  history  of  an  AIDS-defining  illness,  and  were  admitted  to 
the  hospital  with  a  non-immunologically  mediated  illness,  then  he/she  was 
felt  to  be  non-immunocompromised  and  therefore  did  not  receive  any 
chronic  health  points  (unless  for  other  organ  system  failure).  Using  this 
"immunosuppression  guideline",  151  patients  were  given  "chronic  health 
points"  for  having  either  a  history  of  severe  organ  system  insufficiency  or 
immunosuppression  prior  to  hospital  admission. 


39 


Data  Analysis  and  Statistical  Methods 

Medical  record  information  was  collected  on  data  entry  sheets  and 
transferred  into  a  data  management  information  system  (Reflex®  and 
Quatrop^o®  software).  This  software  was  capable  of  performing  all  needed 
data  retrieval  and  sorting  functions.  Data  are  expressed  anci  analyzed  so  as  to 
facilitate  easy  comparison  with  previously  published  APACHE  II  data.  In 
general,  means  are  expressed  ±  the  standard  deviation  (SD).  Predicted 
mortality  risk  was  calculated  using  the  APACHE  II  multiple  logistic 
regression  equation.  The  significance  of  differences  among  groups  of 
continuous  variables  was  determined  by  the  unpaired  Student's  t-test  and  by 
Chi-square  tests  or  binomial  distributions  for  groups  of  dichotomous 
variables.  Statistical  significance  is  defined  as  p  <0.05,  unadjusted  for 
multiple  comparisons. 


ILL  RESULTS 


The  patient  population,  represented  by  the  161  patient  records  (which 
for  the  matter  of  discussing  patient  demographics  will  be  considered  to 
represent  161  patients)  consisted  of  43  women  and  118  men.  The  mean  age  of 
the  population  was  37  years  old  (S.D.=  8.7),  with  a  range  from  21  to  79  years 
old.  The  racial  mix  of  the  group  was  92  Blacks,  14  Hispanics,  54  Whites,  and  1 
Native  American  (Table  7).  The  vast  majority  (97%)  of  the  patients  were 
nonsurgical  admissions.  Most  patients  were  transferred  to  the  MICU  from 
either  the  emergency  room  or  from  another  hospital  floor  (Figure  4).  During 
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the  five  years  of  this  study,  the  number  of  HIV-positive  patients  who  were 
admitted  to  the  MICU  fluctuated  without  a  clear  relationship  to  the  total 
number  of  HIV-positive  patients  who  were  admitted  to  the  hospital. 

The  population's  HIV  risk  factors  were  summarized.  Of  the  total  161 
patients,  107  patients  (66%)  had  a  history  of  intravenous  drug  use  (IVDA),  34 
patients  (21%)  had  a  history  of  homosexual  activity,  9  patients  (6%)  had  a 
history  of  both  IVDA  and  homosexual  activity,  and  29  patients  (18%)  denied 
both  HIV  risk  factors  (Table  7).  Of  the  29  patients  without  either  risk  factor, 
many  had  become  HIV  infected  by  known  sources  or  by  known  risks:  as  a 
recipient  of  infected  blood,  or  through  sexual  contact  with  a  high  risk  partner 
(ie.,  a  known  IV  drug  user,  a  known  HIV-positive  partner,  or  a  prostitute). 

It  was  observed  that  the  risk  factors  associated  with  HIV  infection 
varied  according  to  patients'  sex  and  race.  A  history  of  IVDA  was  greatest  in 
black  males  (84%),  hispanic  males  (82%),  and  white  women  (90%). 
Homosexual  activity,  known  not  to  be  an  HIV  risk  factor  for  women,  was,  as 
expected,  confined  to  the  male  population.  Within  the  male  population, 
homosexual  activity  accounted  for  a  larger  degree  of  HIV  transmission  in  the 
whites  than  in  other  races  (White:  45%,  Black:  19%,  Hispanic:  18%)  (Table  7). 
IVDA  was  an  equal  factor  leading  to  HIV  infection  among  both  females  and 
males  (72%  vs.  71%). 

The  classification  of  the  population  according  to  disease  category 
showed  a  predominance  of  MICU  admissions  due  to  infection  (78  patients, 
48%),  either  in  the  form  of  pneumonia  (44  patients,  27%  of  the  total 
population),  septic  shock  (29  patients,  18%),  or  a  neurologic  infection  (5 
patients,  3%)  (Figure  5).  Other  not  uncommon  diagnoses  were 
gastrointestinal  bleeding  (15  patients),  cardiac  arrest  (9  patients),  self¬ 
intoxication  (7  patients),  respiratory  arrest  (6  patients),  seizure  (5  patients)  and 
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electrolyte/acid-base  disturbance  (5  patients).  The  remaining  50  patients  were 
assigned  one  of  the  other  thirty  diagnostic  categories.  Changes  in  disease 
presentation  over  time  were  not  statistically  significant  (data  not  shown). 

Regarding  initial  severity  of  illness,  the  population's  mean  APACHE  II 
score  was  20.0  (S.D.  =  8.63),  the  median  was  19,  the  range  was  from  1  to  45,  and 
the  distribution  of  scores  reached  its  zenith  between  15  and  19  (Figure  6).  The 
population's  severity  of  illness  (as  reflected  by  APACHE  II  scores)  over  the 
five  year  period  of  study  did  not  show  statistically  significant  changes  (data 
not  shown). 

Regarding  APACHE  II's  prognostic  capability,  APACHE  II  had  a  correct 
outcome  classification  rate  (with  a  mortality  prognosis  >  .50  implying 
predicted  death)  of  67.1%.  This  correct  classification  rate  was  not  markedly 
improved  using  a  wide  range  of  cutoff  points  (Table  8).  A  receiver  operator 
characteristics  (ROC)  curve  was  constructed  based  on  sensitivity  and 
specificity  values.  (Figure  7). 

When  the  APACHE  II  prognostic  scoring  system  was  applied  to  the 
entire  cohort  population,  APACHE  II  significantly  underestimated  the  in- 
hospital  mortality  rate  (35.5%  estimated  (E)  vs.  44.1%  observed  (O),  p  <  .025). 
However,  this  discrepancy  was  largely  explained  by  the  fact  that  APACHE  II 
underestimated  mortality  in  patients  with  pneumonia  (n=44)  (28.5%  (E)  vs. 
50.0%  (O),  p  <  .005)  (Table  9  and  Figure  8).  In  patients  without  pneumonia 
(n=117),  APACHE  II  correctly  estimated  mortality  (38.1%  (E)  vs.  41.9%  (O)).  In 
the  second  largest  disease  category,  sepsis  (n=29),  APACHE  II  slightly 
overestimated  mortality  in  patients  with  sepsis  though  this  was  not 
statistically  significant  (56.4%  (E)  vs.  41.3%  (O)). 

Because  Smith  et.  al.  reported  that  APACHE  II  underestimated 
mortality  in  patients  with  Pneumocystis  cnrinii  pneumonia  (PCP),59  we 
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investigated  whether  APACHE  II  underestimated  mortality  in  patients  with 
pneumonia  of  any  etiology  or  only  due  to  P.  cnrinii.  In  our  44  patients  with 
pneumonia,  the  most  common  organism  isolated  was  Pneumocystis  cnrinii :: 
19  patients  (43%)  were  identified  to  have  been  solely  infected  with  P.  cnrinii 
and  7  patients  had  polymicrobial  infections  with  P.  cnrinii  (Figure  9).  In  the 
other  18  patients  with  pneumonia,  11  patients  had  other  pathogens  identified, 
and  7  patients  never  had  a  pathogen  identified.  Common  pathogens,  beside 
P.  cnrinii,  were  Staphylococcus  nureus,  Streptococcus  pneumonine, 
cytomegnlovirus  (CMV),  and  Hnemophilus  influenzn. 

The  manner  in  which  the  respiratory  pathogen  was  identified  in  our 
population  was  variable,  but  often  the  definitive  diagnosis  was  made  by 
invasive  means  (bronchoaveolar  lavage,  direct  lung  biopsy,  or  tissue  sample 
at  autopsy).  Of  the  44  patients  with  pneumonia,  27  of  them  (61%)  had  a 
negative  initial  sputum  culture  and/or  stain  (Table  10  A).  Of  the  27  patients 
with  negative  sputum  cultures/stains,  20  later  had  positive  microbial 
identification  by  one  of  the  more  invasive  studies.  Consequently,  the 
sensitivity  of  sputum  culture  and  stain  in  our  study  population  was  46%  (17 
of  37)  and  the  negative  predictive  value  of  26%  (7  of  27).  Much  of  the  lack  of 
sensitivity  is  due  to  lack  of  sensitivity  concerning  P.  cnrinii  detection.  Of  the 
26  patients  with  documented  P.  carinii  infection,  only  6  had  P.  cnrinii 
identified  from  their  sputum  (17  were  identified  by  BAL,  and  3  were 
identified  by  tissue  sample,  either  at  autopsy  or  direct  lung  biopsy).  Thus,  the 
sensitivity  of  initial  sputum  stain  and  culture  in  identifying  P.  cnrinii  was 
only  23.1%  (Table  10  B). 

Because  infection  with  P.  carinii  was  the  most  prevalent  cause  of 
pneumonia  in  the  study  population,  estimated  and  observed  mortality  rates 
in  pneumonia  patients  with  and  without  P.  cnrinii  infection  were  compared. 
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APACHE  II  accurately  predicted  mortality  in  patients  with  PCP  (28.2%  (E)  vs. 
34.6%  (O))  but  significantly  underestimated  mortality  in  patients  with 
pneumonia  of  another  etiology  (28.9%  (E)  vs.  72.2%  (O),  p  <  .005)  (Table  11  A). 
Because  of  concern  that  these  results  were  influenced  by  the  uncertainty  that 
was  introduced  by  seven  patients  never  having  a  pathogen  identified,  we 
reanalyzed  the  results  separating  these  seven  patients  (Table  11  B).  This 
reanalysis  yielded  results  which  suggested  that  APACHE  II  underpredicted 
mortality  in  both  groups  of  patients  whose  pneumonia  was  not  caused  by  P. 
carinii  (those  with  another  pathogen  identified  and  those  who  never  had  a 
pathogen  identified). 

Initial  data  analysis  demonstrated  that  APACHE  II's  capacity  to  predict 
group  mortality  rates  was  correlated  with  patients'  initial  severity  of  illness 
(measured  by  APACHE  II  scores).  It  appeared  as  if  APACHE  II  accurately 
predicted  group  mortality  rates  in  patients  with  high  APACHE  II  scores  (>25), 
was  less  accurate  in  patients  with  high  mid-range  APACHE  II  scores  (20-24), 
was  inaccurate  in  patients  with  low  mid-range  APACHE  II  scores  (15-19),  and 
was  most  inaccurate  in  patients  with  low  APACHE  II  scores  (<14)  (Table  12 
and  Figure  10).  However,  after  patients  with  pneumonia  were  excluded,  it 
was  observed  that  the  initial  inaccuracy  in  the  distribution  of  the  lower 
APACHE  II  scores  (<14  and  15-19)  disappeared  (Table  13,  Figure  11).  However, 
in  patients  with  pneumonia,  the  distribution  of  APACHE  II  scores  was 
important.  APACHE  II  accurately  predicted  mortality  in  pneumonia  patients 
with  APACHE  II  scores  >  14.  In  pneumonia  patients  with  APACHE  II  scores  < 
14,  predicted  mortality  was  significantly  less  than  observed  mortality  (Table 
14,  Figure  12). 

Similar  to  the  inaccuracy  of  APACHE  II  in  predicting  mortality  in 
patients  with  pneumonia,  mean  APACHE  II  scores  proved  unable  to 


44 


differentiate  between  survivors  (S)  and  nonsurvivors  (NS)  with  pneumonia 
(18.9  (S)  vs.  21.5  (NS))  (Table  15).  In  all  other  patients,  survivors'  mean 
APACHE  II  score  was  significantly  lower  than  the  mean  score  in 
nonsurvivors  (17.3  (S)  vs.  26.3  (NS),  p  <  .005  ). 

During  the  evaluation  of  outcomes,  unanticipated  differences  in 
survival  were  observed  in  patients  with  pneumonia.  Patients  with  PCP  had  a 
significantly  greater  survival  rate  than  patients  with  pneumonia  of  another 
etiology  (65.4%  vs.  27.8%,  respectively,  p  <  .014)  (Table  16  A).  This  difference 
existed  despite  the  two  groups'  similar  initial  prognoses  (as  measured  by 
APACHE  II  scores)  and  their  need  for  respiratory  support  (intubation  and 
mechanical  ventilation)  (Table  17).  The  difference  in  survival  rates  was 
basically  unchanged  if  only  intubated  patients  were  considered  (63.6%  in 
intubated  patients  with  PCP  vs.  26.7%  in  intubated  patients  with  pneumonia 
of  another  etiology)  (data  not  shown). 

When  pneumonia  patients  who  never  had  an  organism  identified 
were  separated  into  their  own  group,  a  difference  in  survival  rates  remained: 
14.3%  survival  of  patients  with  pneumonia  without  microbial  identification, 
36.4%  survival  of  patients  with  non-P.  carinii  pneumonia,  and  65.4% 
survival  of  patients  with  PCP.  The  difference  in  mortality  rates  was 
statistically  significant  between  patients  with  PCP  and  patients  with 
pneumonia  of  unknown  etiology.  The  difference  in  mortality  between 
patients  with  PCP  and  patients  with  pneumonia  caused  by  another  known 
pathogen  was  not  statistically  significant  (though  perhaps  still  clinically 
significant)  (Table  16  B  and  16  C). 

The  degree  of  immunosuppression  was  investigated  to  determine  its 
relationship  to  medical  outcomes.  Initially  CD4+  T-cell  count  were  recorded. 
However,  since  only  34  of  161  patients'  charts  contained  these  values,  we 
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decided  to  use  the  total  lymphocyte  count  (TLC)  as  our  marker  of 
immunosuppression.  This  was  in  accordance  with  a  report  that  TLC  is  an 
accurate  measure  of  CD4+  counts.81  TLC  was  calculated  for  each  patient  using 
the  following  formula:  TLC  =  #  of  White  blood  cells  (WBC's)  x  %  of  WBC's 
which  were  lymphocytes  (lymphocyte  differential).  In  the  total  cohort  of  148 
patients  with  adequate  information,  a  TLC  <  200  was  significantly  associated 
with  increased  mortality  (p  =  .003)  (Table  18  A).  Because  it  was  postulated  that 
TLC  would  be  most  significantly  associated  with  mortality  rates  in  patients 
with  infection,  we  reanalyzed  the  data  separating  patients  with  either 
pneumonia  or  sepsis.  A  TLC  <  200  was  found  to  be  significantly  associated 
with  increased  mortality  in  patients  with  infection  (p  =  .003)  but  not 
significantly  associated  with  outcome  in  all  other  patients  (without 
pneumonia  or  sepsis)  (p  =  .105)  (Table  18  B  and  18  C).  When  the  patients  with 
infection  were  separated  into  their  original  diagnostic  categories  (pneumonia 
or  sepsis),  it  was  observed  that  TLC  <  200  was  significantly  associated  with 
mortality  in  pneumonia  patients.  In  patients  with  sepsis,  although  TLC  <  200 
was  positively  correlated  with  increasing  mortality,  it  was  not  statistically 
significant  (Table  18  D  and  18  E). 

In  addition  to  assessing  the  effect  of  TLC  on  survival,  we  studied 
APACHE  IBs  predictive  accuracy  in  relation  to  the  location  of  treatment  prior 
to  admission  or  transfer  to  the  MICU.  Estimated  and  observed  mortality  were 
analyzed  for  patients  in  the  two  largest  patient  groups:  those  patients 
admitted  from  the  emergency  room  (n=81)  and  those  transferred  from  other 
hospital  floors  (n=72).  APACHE  II  accurately  estimated  mortality  in  patients 
admitted  from  the  emergency  room  (37.6%  (E)  vs.  44.4%  (O)),  but  significantly 
underestimated  mortality  in  patients  transferred  from  other  hospital  floors 
(33.3%  (E)  vs.  44.4%  (O),  p  <  .05)  (Table  19). 
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Because  AIDS  has  focused  so  much  attention  on  ICU  utilization  and 
cost  issues,  APACHE  II's  accuracy  relationship  to  patients'  length  of  stay  (LOS) 
in  the  MICU  was  evaluated.  The  cohort  had  a  mean  MICU  LOS  of  7.3  ±  15.0 
days  (±  SD).  In  respect  to  mortality,  it  was  found  that  APACHE  II  correctly 
estimated  mortality  in  the  144  patients  with  a  MICU  LOS  <  14  days  (34.9%  (E) 
vs.  40.3%  (O))  but  underestimated  mortality  in  the  17  patients  with  a  MICU 
LOS  >  14  days  (41.0%  (E)  vs.  82.4%  (O),  p  <  .005)  (Table  20). 


IV.  DISCUSSION 


This  study  was  undertaken  in  order  to  evaluate  the  APACHE  II 
prognostic  scoring  system  when  applied  to  an  HIV-positive  population 
admitted  to  the  MICU.  It  was  observed  that  APACHE  II  significantly 
underestimated  mortality  in  the  HIV-positive  population  to  which  it  was 
applied.  This  discrepancy,  however,  was  due  to  a  selective  underestimation 
of  mortality  among  patients  with  pneumonia.  In  patients  without 
pneumonia,  APACHE  II  accurately  predicted  mortality.  Interestingly, 
although  APACHE  II  underestimated  mortality  in  patients  with  respiratory 
infection  (pneumonia),  it  correctly  predicted  (in  fact,  slightly  overestimated) 
mortality  in  patients  with  systemic  infection  (sepsis). 

The  results  of  this  study  both  agree  with  and  conflict  with  results 
previously  reported  by  Smith  et.  al..  Both  studies  found  that  APACHE  II 
significantly  underestimated  mortality  in  HIV-positive  patients.  As  well, 
both  studies  found  that  mean  APACHE  II  scores  were  significantly  different 
between  survivors  and  nonsurvivors  in  patients  not  diagnosed  with 
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pneumonia.  However,  our  results  differ  from  those  reported  by  Smith  et.  al. 
in  the  explanation  given  for  APACHE  IPs  shortcomings.  Whereas  Smith  et. 
al.  reported  that  APACHE  IPs  inaccuracy  is  caused  by  underestimating 
mortality  in  patients  with  PCP,59  our  results  indicate  that  APACHE  IPs 
inaccuracy  in  HIV-positive  patients  is  due  to  an  underestimation  of  mortality 
in  patients  with  pneumonia,  particularly  in  patients  with  non-Pneumocystis 
carinii  pneumonia  (NPCP).  In  this  study,  APACHE  II  correctly  predicted 
mortality  among  patients  with  PCP. 

This  conflict  between  our  results  and  those  of  Smith  et.  al.  may  be 
related  to  the  difference  in  observed  PCP  mortality  rates  between  the  two 
studies.  Smith  et.  al.  studied  a  population  of  PCP  patients  with  a  mortality 
rate  of  86.5%  while  the  current  study's  PCP  patients  had  a  mortality  rate  of 
only  34.6%. 

The  lower  mortality  rate  of  our  study  is  likely  related  to  three  factors. 
First,  the  PCP  patients  in  this  study  were  not  as  acutely  ill  as  the  patients 
studied  by  Smith  et.  al.  (as  determined  by  differences  in  predicted  mortality 
rates,  28.2%  vs.  45.8%,  respectively).  This  may  be  due  to  differences  in 
protocol  between  Yale-New  Haven  Hospital  (YNHH)  (the  hospital  in  this 
study)  and  the  New  York  Veterans  Administration  Medical  Center 
(NYVAMC)  (the  hospital  in  the  study  by  Smith  et.  al.).  For  instance,  the 
threshold  of  illness  which  engendered  a  patient  to  be  transferred  to  the  MICU 
may  have  been  much  lower  at  YNHH  than  at  the  NYVAMC.  Therefore,  the 
patients  included  in  this  study  would  be  less  severely  ill  than  those  patients 
included  in  the  study  by  Smith  et.  al. 

Second,  because  the  patients  in  this  study  were  admitted  to  the  MICU 
later  in  the  "AIDS  epidemic,"  (October  1986  to  September  1991),  they  could  be 
expected  to  have  lower  mortality  rates  based  solely  on  the  trend  toward  better 
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outcomes  in  PCP  patients.  Whether  the  decreased  mortality  rates  are  due  to 
better  treatments  for  both  P.  carinii  infection  and  HIV  infection  (ie., 
pentamidine  and  AZT)  or  to  a  decreased  virulence  of  P.  carinii  is  not  known. 

And  third,  a  selection  bias  probably  exists.  Because  data  was 
retrospectively  collected  in  both  studies,  the  assignment  of  diagnostic 
categories  was  based  on  the  judgment  of  the  patients'  bedside  clinicians. 
However,  in  this  study  and  presumably  in  the  study  by  Smith  et.  al.,  clinicians 
often  would  not  clearly  site  a  specific  diagnosis.  In  this  study,  diagnoses  of 
"pneumonia,  rule  out  sepsis"  were  not  uncommon  in  the  admitting  orders  to 
the  MICU.  Thus,  the  data  collector  was  forced  to  make  a  decision  based  on  the 
available  information.  However,  in  such  decisionmaking  circumstances  as 
this,  APACHE  II  does  not  provide  data  collectors  with  detailed  guidelines 
concerning  the  classification  of  disease.  Hence,  the  assignment  of  diagnostic 
categories  was  influenced  by  the  judgment  of  the  data  collector. 

Consequently,  it  is  possible  that  the  assignment  of  patients  into  the 
disease  categories  of  "pneumonia"  and  "sepsis"  was  done  differently  during 
the  two  studies.  In  this  study,  out  of  161  patients,  44  were  classified  as 
"pneumonia"  and  29  were  classified  as  "sepsis."  In  the  study  by  Smith  et.  ah, 
out  of  83  patients,  54  were  classified  as  "pneumonia"  and  only  6  were 
classified  as  "sepsis."  Thus,  in  this  study,  the  ratio  of  patients  with 
pneumonia  to  those  with  sepsis  was  1.5  whereas  in  the  study  by  Smith  et.  al. 
the  ratio  was  9.0.  This  difference  (1.5  vs.  9.0)  in  the  ratio  of  patients  with 
pneumonia  to  those  with  sepsis  may  reflect  an  actual  difference  in  the  MICU 
populations  at  the  different  institutions.  However,  it  seems  more  likely  (as 
will  be  discussed  below)  that  this  difference  is  due  to  the  manner  in  which 
patients  were  assigned  to  diagnostic  categories.  Therefore,  the  observed  low 
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rate  of  mortality  in  PCP  patients  in  this  study  may  be  partially  due  to  the 
presence  of  selection  bias. 

Although  the  disagreement  between  this  study's  results  and  those  of 
Smith  et.  al.  may  partially  be  explained  by  the  difference  in  PCP  mortality 
rates  observed  during  the  two  studies,  the  difference  in  mortality  rates 
between  the  two  studies  does  not  explain  why  APACHE  II  underestimated 
mortality  in  all  patients  with  pneumonia  in  this  study.  The  possible 
explanation  for  the  underestimation  of  mortality  in  pneumonia  patients  is 
threefold.  First,  as  discussed  in  relation  to  PCP  mortality  rates,  there  is  likely 
to  be  a  selection  bias  affecting  all  patients  with  pneumonia.  Because  the 
APACHE  II  prognostic  scoring  system  does  not  provide  decisionmaking 
algorithms  concerning  the  assignment  of  diagnostic  categories,  individual 
data  collectors  must  make  these  decisions  on  their  own.  This  decision  enters 
into  the  calculation  of  predicted  mortality  using  the  APACHE  II  multiple 
logistic  regression  equation  (See  below).  Thus,  the  absence  of  decisionmaking 
algorithms  for  the  categorization  of  disease  directly  influences  the  predicted 
mortality  rate  and  the  prognostic  accuracy  of  the  APACHE  II  system. 


APACHE  II  multiple  logistic  regression  equation 

In  (R/l-R)=  -3.517+  .146  (APACHE  II  score)+  DCW  +  .603  {only  if 

emergency  postoperative}, 

R=Risk  of  death  and  DCW=  Diagnostic  category  weight 


The  lack  of  algorithms  is  especially  significant  for  patients  whose 
APACHE  II  scores  fall  in  the  middle  range  of  scores.  When  the  APACHE  II 
score  is  either  low  or  high,  the  relative  importance  of  diagnostic  categories  is 
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small.  Within  the  middle  range  of  scores,  however,  variations  in  disease 
classification  greatly  affect  calculated  risk  predictions.  (This  is  a  direct 
consequence  of  the  logistic  structure  of  the  regression  equation  and  has  been 
confirmed  by  Knaus  et.  al.  regarding  APACHE  III.45) 

The  conclusion  that  the  APACHE  II  predicted  morality  calculations  are 
subject  to  interobserver  variation  in  the  assignment  of  diagnostic  categories 
has  been  previously  reported.  Jackson  et.  al.  found  a  large  degree  of  inter¬ 
observer  variation  in  the  assignment  of  diagnostic  categories  for  a  small  test 
population.83  In  the  assessment  of  disease,  the  four  participants  in  the  study 
agreed  on  the  disease  categorization  in  only  four  of  the  ten  patients.  Concern 
about  interobserver  reliability  (and  intraobserver  reliability)  in  the 
determination  of  precipitating  factors  of  ICU  admission  has  also  been 
expressed  by  Teres  and  Lemeshow  42'84 

To  further  complicate  the  issue  of  categorizing  a  patient  with 
pneumonia  versus  one  with  sepsis  is  the  lack  of  consensus  regarding  the 
definition  of  sepsis.  Working  definitions  of  the  terms  "bacteremia," 
"septicemia,"  "sepsis,"  "sepsis  syndrome,"  and  "septic  shock"  have  been 
offered,85  but  it  is  unlikely  that  they  are  clinically  differentiated  in  the  same 
manner  at  separate  institutions.  Consequently,  individual  clinicians  and  data 
collectors  involved  in  one  study  are  not  likely  to  be  categorizing  patients' 
diseases  in  the  same  way  as  individuals  involved  in  a  similar  study  at 
another  institution. 

Thus,  APACHE  IPs  inaccuracy  in  predicting  outcome  in  pneumonia 
patients  is  likely  to  be  partially  due  to  an  absence  of  precisely  defined 
decisionmaking  algorithms,  and  the  resulting  interobserver  variation 
affecting  disease  categorization.  A  second  potential  cause  of  APACHE  II's 
underestimation  of  mortality  in  pneumonia  patients  is  an  inadequate 
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adjustment  for  the  severity  of  immunologic  suppression  by  HIV.  In  the 
comparison  of  estimated  and  observed  mortality  rates  in  pneumonia  patients 
(Table  14,  Figure  12),  it  is  apparent  that  APACHE  II's  greatest  degree  of 
inaccuracy  focuses  on  patients  with  relatively  low  APACHE  II  scores  and  a 
lesser  apparent  severity  of  acute  illness.  However,  it  is  also  clear  that 
APACHE  II  slightly  underestimates  mortality  across  the  entire  distribution  of 
APACHE  II  scores.  Unlike  the  results  of  Smith  et.  al.,  which  led  to  the 
conclusion  that  APACHE  II  does  not  adequately  adjust  for  the  severity  of  PCP, 
our  results  point  to  a  broader  deficit:  an  inadequate  adjustment  for  the  effects 
of  immunosuppression,  particularly  in  patients  with  pneumonia  with  low 
APACHE  II  scores. 

The  third  factor  which  can  potentially  explain  some  of  the  discrepancy 
between  estimated  and  observed  mortality  in  pneumonia  patients  is  the 
significant  number  of  patients  with  an  unknown  etiology  of  their 
pneumonia.  It  was  observed  that  APACHE  II  significantly  underestimated 
mortality  for  these  patients.  This  inaccuracy  is  likely  to  be  the  result  of  two 
things:  first,  patients  with  pneumonia  of  unknown  etiology  are  more  likely 
to  be  infected  with  CMV  or  cryptococcus  because  these  organisms  are  more 
difficult  to  identify  or  culture.  These  patients  are  also  known  to  be  more 
severely  immunosuppressed  than  patients  with  PCP.  Second,  the  empiric 
antibiotic  therapy  directed  at  an  unknown  organism  is  likely  to  be  less 
efficacious  than  specific  therapy  directed  at  a  known  pathogenic  agent.  The 
combination  of  these  factors  helps  to  explain  the  low  survival  rate  of  patients 
with  pneumonia  of  unknown  etiology  and  the  underestimation  of  mortality 
for  pneumonia  patients  as  a  whole. 

Thus,  APACHE  II's  underestimation  of  mortality  in  patients  with 
pneumonia  is  multifactorial.  It  is  likely  due  to  (1)  APACHE  II's  lack  of 
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decisionmaking  algorithms;  (2)  APACHE  II's  inadequate  risk  adjustment  for 
HIV  infection  in  patients  with  a  diagnosis  indicating  an  infectious  etiology; 
and  (3)  a  significant  proportion  of  pneumonia  patients  in  this  study 
population  without  a  known  etiologic  agent  for  the  cause  of  their 
pneumonia. 

After  research  on  the  current  study  was  begun,  three  more  reports 
concerning  APACHE  II's  application  to  patients  with  AIDS  were  published. 

In  1991,  Benson  et.  al.60  published  a  report  which  analyzed  retrospectively 
collected  data  from  72  hospitalized  patients  (not  necessarily  in  the  ICU)  with 
first-episode  PCP  admitted  between  April  1985  and  December  1987.  Only  12 
patients  required  mechanical  ventilation  and  the  mortality  attributable  to  PCP 
in  the  total  cohort  was  low  (8  deaths).  Four  parameters  (admission  serum 
albumin,  serum  LDH,  alveolar-arterial  oxygen  gradient  (AaP02),  and  the 
APACHE  II  score)  were  compared  for  their  ability  to  predict  mortality.  Serum 
albumin  levels,  LDH  levels,  and  APACHE  II  scores  were  significantly 
different  between  the  group  of  survivors  and  the  group  of  nonsurvivors.  In  a 
stepwise  discriminant  analysis,  only  APACHE  II  score  and  serum  LDH  levels 
were  statistically  associated  with  mortality. 

Benson's  conclusion,  namely  that  the  APACHE  II  score  is  one  of  the 
measures  which  was  significantly  associated  with  mortality  in  patients  with 
PCP,  is  in  frank  disagreement  with  the  results  previously  reported  by  Smith 
et.  al.  but  is  in  agreement  with  the  present  study's  conclusions.  This  is  likely 
a  function  of  either  one  or  both  of  the  following:  the  improved  survival  of 
patients  with  PCP  during  the  latter  1980’s,  and  the  initial  severity  of  acute 
illness  (as  described  by  the  predicted  mortality  rates  and  the  need  for 
mechanical  ventilation)  (Table  21). 
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Table  21:  Characteristics  of  Patients  with  PCI’  in  the  Different  Studies 

which  have  Applied  APACHE  II  to  AIDS 


Author 

Time  frame 
of  Stud  v 

%  needing  MV 

Predicted 
Mortality  (%) 

Observed 
Mortality  (%) 

Smith 

1 /84  -  9/88 

76%  (41  /54) 

44.3* 

86.5* 

Benson 

1/85-12/87 

16%  (12/75) 

14.4** 

16.0 

Current  study 

10/86-9/91 

85%  (22/26) 

28.2 

34.6 

This  value  is  based  only  on  those  37  patients  who  had  PCP  requiring  mechanical  ventilation. 
Predicted  and  observed  mortality  rates  for  patients  with  other  types  of  pneumonia  were  not 
reported. 

**  This  value  was  calculated  from  the  reported  mean  APACHE  11  scores. 

MV=  mechanical  ventilation 


Another  study  concerning  APACHE  II's  application  to  patients  with 
AIDS  was  presented  in  the  literature  in  the  form  of  an  abstract.61  jiva  et.  al. 
studied  76  AIDS  patients  with  successive  hospital  admissions.  In  1991,  they 
reported  that  there  was  a  positive  correlation  between  APACHE  II  scores  and 
mortality  in  AIDS  patients.  However,  it  was  noted  that  observed  mortality 
was  higher  at  every  APACHE  II  score  for  patients  with  AIDS  than  that 
reported  for  patients  without  AIDS.  Thus,  the  APACHE  II  prognostic  scoring 
system,  though  correlated  with  mortality  outcome,  underpredicted  mortality 
in  patients  with  AIDS.  Unfortunately  for  the  sake  of  comparison  with  the 
current  study,  the  percentage  of  the  study  population  that  was  categorized  as 
having  pneumonia  was  not  reported. 

The  final  study,  published  in  1992  by  Montaner  et.  al.,62  compared  the 
association  of  the  APACHE  II  score,  the  acute  lung  injury  (ALI)  score,  the 
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AIDS  Prognosis  score,  the  modified  multi-system  organ  failure  (MSOF)  score, 
and  serum  LDH  with  mortality  in  patients  with  PCP.  The  patient  population 
was  comprised  of  52  ICU  admissions  between  January  1985  and  April  1991.  47 
patients  (90%)  required  mechanical  ventilation,  and  the  mortality  of  this 
group  was  72%.  In  the  5  patients  who  did  not  receive  mechanical  ventilation, 
4  died  (80%).  The  results  of  the  study  indicate  that  the  MSOF  score  was  the 
only  independent  predictor  of  mortality,  though  there  was  a  positive 
correlation  between  mortality  and  both  MSOF  score  and  serum  LDH.  The 
AIDS  Prognostic  Staging  System,  the  ALI  score,  and  the  APACHE  II  prognostic 
scoring  system  were  found  to  be  poor  prognostic  tools  in  this  group  of 
patients  with  AIDS.  Montaner  et.  al.  conclude  that  "it  is  not  the  severity  of 
lung  injury  that  predicts  mortality  [in  patients  with  PCP],  but  rather  the  extent 
of  multisystem  organ  failure."  It  is  unclear  whether  this  conclusion  has 
validity  in  other  patient  groups.  It  appears  that  Montaner  et.  al.  believe  that 
survival  among  critically  ill  patients  is  dependent  not  only  upon  the  severity 
of  physiologic  abnormality  caused  by  dysfunction  in  a  primary  organ  system, 
but  also  upon  the  degree  of  other  organ  system  compromise  or  impairment. 

Although  Montaner's  results  concerning  the  prognostic  accuracy  of 
APACHE  II  when  applied  to  patients  with  PCP  conflict  with  the  present 
study's  results,  Montaner's  conclusion  appears  to  make  good  clinical  sense  in 
that  patients  with  mulitsystem  organ  failure  may  be  expected  to  have  a  high 
mortality  rate.86  Because  the  APACHE  II  prognostic  scoring  system  was 
designed  to  account  for  only  one  factor  precipitating  admission  to  the  ICU,  it 
is  systematically  unable  to  accurately  adjust  for  mortality  risk  in  patients  with 
multisystem  organ  failure.  It  can  be  argued  that  all  AIDS  patients  with 
pneumonia  requiring  mechanical  ventilation  and/or  intensive  care 
monitoring  have  at  least  two  organ  systems  in  failure:  respiratory  and 
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immunologic.  However,  APACHE  II  is  only  designed  to  account  for 
pneumonia  as  a  factor  necessitating  ICU  admission  whereas  it  accounts  for 
immunologic  insufficiency  in  Chronic  Health  Status.  Concern  for  the 
accuracy  of  APACHE  II  in  patients  with  multisystem  organ  failure  is 
especially  relevant  in  AIDS  patients  because  of  the  complexity  of  disease 
manifest  by  AIDS  patients.  Consequently,  patients  with  AIDS  may  have 
outcomes  (survival  versus  nonsurvival)  that  are  not  primarily  caused  by  the 
initial  disease  process  which  precipitated  MICU  admission. 

The  observation  that  multisystem  organ  failure  presents  difficulties  to 
the  APACHE  II  system  was  acknowledged  by  Knaus  and  his  colleagues  when 
they  stated  that  " [i] t  may  be  difficult  in  patients  with  multi-system  failure  to 
decide  which  system  failure  is  primary  and  which  is  secondary,  etc.  Your  best 
clinical  judgment  is  to  be  used  to  make  the  decision;  and  if  two  systems  are 
assessed  to  be  'equal'  in  failure,  one  system  can  be  'arbitrarily'  indicated  as 
primary."80 

Thus,  Montaner’s  results  imply  that  in  patients  with  pneumonia  and 
known  severe  immunosuppression,  APACHE  II  is  inaccurate  because  it  does 
not  adequately  account  for  multisystem  organ  failure  and,  thus,  should  be 
expected  to  underestimate  mortality  rate.  This  conclusion  is  somewhat 
supported  by  our  findings  that  TLC  <  200  is  strongly  associated  with 
increasing  mortality  in  patients  with  pneumonia.  In  addition,  this  finding 
may  help  explain  why,  in  this  study,  APACHE  II  so  greatly  underestimated 
mortality  in  patients  with  pneumonia  and  low  APACHE  II  scores.  Perhaps 
the  group  of  pneumonia  patients  with  low  APACHE  II  scores  who  did  not 
survive  hospitalization  were  actually  more  severely  immunocompromised 
(TLC  <  200)  than  those  who  survived.  Although  the  cause  of  this  discrepancy 


is  uncertain,  the  degree  of  "unaccounted"  immunosuppression  may  be  of 
explicative  value. 

Our  results  indicate  that  APACHE  II's  accuracy  in  predicting  mortality 
rates  varies  according  to  the  treatment  location  prior  to  admission  to  the 
MICU.  APACHE  II  accurately  estimated  mortality  in  patients  admitted  from 
the  emergency  room,  but  significantly  underestimated  mortality  in  patients 
transferred  from  other  hospital  floors.  This  confirms  the  results  previously 
reported  by  Escarce.66 

In  respect  ICU  utilization,  our  results  indicate  that  the  accuracy  of 
APACHE  II's  estimated  mortality  rates  was  dependent  on  the  length  of  stay 
(LOS)  of  patients  in  the  MICU.  APACHE  II  correctly  estimated  mortality  in 
patients  with  a  MICU  LOS  <  14  days  but  underestimated  mortality  in  patients 
with  a  MICU  LOS  >  14  days.  This  difference  between  mortality  rates  according 
to  length  of  stay  in  the  MICU  supports  the  finding  of  a  similar  difference 
recently  reported  concerning  very  old  patients  (>80  years)  who  were  being 
mechanically  ventilated  in  the  MICU.82 

Although  the  APACHE  II  system  has  received  its  share  of  criticism, 
many  of  the  weaknesses  of  APACHE  II  have  been  acknowledged  by  Knaus  et. 
ah,  and  aPACHE  II  was  subsequently  modified  to  incorporate  some  necessary 
changes.  The  result  of  these  changes  was  the  APACHE  III  prognostic  scoring 
system  which  was  validated  on  a  cohort  of  17,440  patients.45  The  APACHE  III 
prognostic  scoring  system,  as  expected,  is  based  on  the  same  conceptual 
approach  as  the  original  APACHE  system  and  APACHE  II.  However,  changes 
were  made  in  selecting  and  weighting  the  physiologic  variables  which 
determine  the  acute  physiology  score  (more  variables  were  chosen,  and  the 
weights  have  a  greater  range).  The  concern  for  lead-time  bias  in  terms  of 
location  and  duration  of  treatment  prior  to  ICU  admission  has  been 
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addressed.  The  number  of  diagnostic  categories  was  expanded  from  fifty-three 
to  seventy-eight,  allowing  for  more  precision  in  the  categorization  of  disease, 
and  the  diagnostic  category  weights  were  revised.  As  well,  the  determination 
of  physiologic  reserve  was  reworked.  The  age  assessment  has  been  re¬ 
weighted  and  the  list  of  significant  comorbid  conditions  has  been 
substantially  redefined  (only  those  comorbid  conditions  which  influenced 
patients'  immunologic  status  were  found  to  be  significant  upon  statistical 
analysis). 

At  present,  the  published  results  of  APACHE  III  indicate  that  its  overall 
explanatory  power  is  greater  than  that  of  APACHE  II.  However,  in  relation  to 
our  study,  its  accuracy  when  applied  to  an  HIV-positive  population  and 
specificcdly  to  HIV-positive  patients  with  non -Pneumocystis  carinii 
pneumonia  has  not  been  independently  validated.  This  is  one  area  which 
warrants  future  research,  particularly  in  relation  to  patients  with  pneumonia 
(either  PCP  or  NPCP).  As  well,  our  results  indicate  that  some  measure  of  the 
severity  of  immunosuppression  (if  available,  CD4+  T-cell  counts;  if  not 
available,  then  total  lymphocyte  count)  may  provide  added  prognostic 
accuracy  to  the  APACHE  II  (and  possibly  the  APACHE  III)  system.  This 
deserves  to  be  studied,  particularly  in  regard  to  HIV-positive  patients  with 
pneumonia  whose  APACHE  II  scores  fall  in  the  low  range  of  the  distribution. 

In  conclusion,  our  results  indicate  that  the  APACHE  II  prognostic 
scoring  system  is  not  an  accurate  method  of  stratifying  HIV-positive  MICU 
patients  with  pneumonia  (particularly  NPCP)  in  respect  to  in-hospital 
mortality.  When  applied  to  HIV-positive  patients  without  a  diagnosis  of 
pneumonia,  however,  APACHE  II  proved  to  be  an  accurate  method  of 
estimating  mortality  and  risk  adjustment.  The  reason  for  APACHE  IPs 
underestimation  of  mortality  is  multifactorial.  It  is  likely  to  be  due,  at  least 
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partially,  to  (1)  inadequate  adjustment  for  the  severity  of  immunologic 
suppression  in  patients  with  a  diagnosis  indicating  an  infectious  etiology  of 
their  illness;  (2)  lack  of  precisely  defined  decisionmaking  algorithms  to  aid  in 
the  assignment  of  disease  categories;  and  (3)  a  significant  proportion  of  HIV¬ 
positive  patients  with  pneumonia  of  unknown  etiology  who  may  be 
uniquely  resistant  to  treatment.  The  use  of  TLC  as  an  additional  risk  factor 
may  improve  predictive  accuracy  of  ICU  risk  adjustment  models  for  use  in 
HIV-positive  patients  with  diagnoses  of  infection. 


Eigure  3;  The  APACHE  II  Prognostic  Scoring  System 
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Figure  4:  Location  prior  to  MICU 


Figure  5:  Diseases  Precipitating  MICU  Admission 
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Figure  6:  Distribution  of  APACHE  SI  Scores 
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Figure  8:  Estimated  vs.  Observed  Mortality 
by  Disease  Category 
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Figure  9:  Pathogens  Causing  Respiratory  Infection 
in  HIV-Positive  Patients  Admitted  to  MICU 
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Figure  10:  Estimated  vs.  Observed  IVlortaS ity  Rates 
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Figure  11:  Estimated  vs.  Observed  Mortality  Rates, 

Excluding  Patients  with  Pneumonia 


Figure  12:  Estimated  vs.  Observed  Mortality  Rates 

in  Patients  with  Pneumonia 
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Table  7:  Population  Demographics:  Sex,  Race,  and  HIV  Risk  Factors 


HIV  Risk  Men 

Factor  _ (N=118) _ 

B  H  W  NA 


Women 

(N=43) 

B  H  W 


IVDA*  (N=98)  45  8 

Homosexual 

Activity**  (N=25)  5  1 

Both+  (N=9)  7  1 

Neither"*’"*’  (N=29)  5  1 


13  1 

19  0 

1  0 

11  0 


20  2  9 

0  0  0 

0  0  0 

10  1  1 


History  of  intravenous  drug  abuse,  no  history  of  homosexual  activity 
History  of  homosexual  activity,  no  history  of  IVDA 
’*’  History  of  both  IVDA  and  homosexual  activity 
"*”*"  History  of  neither  IVDA  nor  homosexual  activity 

B=  Black 
H=  Hispanic 
W=  White 

NA=  Native  American 


Table  8:  APACHE  II's  Accuracy  in  Outcome  Prediction  and  Statistical  Power 


Cutoff  Point 

Sensitivity* 

Specificity* 

PPV* 

NPV* 

%  Co 

>  .10 

94.4 

24.4 

49.6 

84.6 

55.2 

>  .20 

80.3 

46.6 

54.3 

75.0 

61.4 

>  .30 

66.2 

70.0 

63.5 

72.4 

66.4 

>  .35 

57.7 

77.8 

67.2 

70.0 

68.9 

>  .40 

53.5 

80.0 

67.8 

68.6 

68.3 

>  .45 

53.5 

81.1 

69.1 

68.9 

68.9 

>  .50 

43.7 

85.5 

70.5 

65.8 

67.1 

>  .60 

36.6 

92.2 

78.8 

64.8 

67.7 

>  .70 

26.8 

93.3 

76.0 

61.7 

64.0 

>.80 

18.3 

98.9 

92.9 

60.5 

63.4 

>  .90 

9.9 

100 

100 

57.0 

58.3 

All  numbers  are  percentages 


Table  9:  Estimated  vs.  Observed  Mortality  by  Disease  Category 


Disease  Cateeorv 

N 

Estimated 

Mortality  (%) 

Observed 

Mortality  (%) 

X2 

p  value 
(d.f.=  1) 

All 

161 

35.5 

44.1 

5.19 

<  .025 

Pneumonia 

44 

28.5 

50.0 

9.98 

<  .005 

Not  Pneumonia 

117 

38.1 

41.9 

.709 

NS 

Sepsis 

29 

56.4 

41.3 

2.67 

NS 

Pneumonia  or  Sepsis 

73 

39.6 

46.6 

1.48 

NS 

Neither  Pneumonia 
nor  Sepsis 

88 

32.2 

37.5 

1.13 

NS 

Table  10  A:  Accuracy  of  Sputum  Stain  and  Culture  for  the  Identification  of 
Pathogens  in  HIV-Positive  Patients  with  Pneumonia 


Sputum 
test  results 


Pathogen  No  pathogen 

identified  ever  identified 


Positive 


17* 


0 


Negative 


20 


7 


*  A- 


Sensitivity=  17/37=  46% 

Negative  Predictive  Value=  7 / 27=  26% 
False  Negative  Rate=  20/27=  74% 


For  the  calculation  of  sensitivity,  all  positive  sputum  cultures/ stains  are  assumed  to  be 
accurate. 

**  For  the  calculation  of  negative  predictive  value,  patients  who  never  had  a  pathogen 
identified  were  assumed  to  not  be  microbial ly  infected. 


Table  10  B:  Accuracy  of  Sputum  Stain  and  Culture  for  the  Identification  of 
Pneumocystis  carinii  in  HIV-Positive  Patients  with  Pneumonia 


Sputum 
test  results 


Documented* 
P.  car  in  ii 


Positive 


6 


Negative 


20 


Sensitivity=  6/26=  23% 


P.  carinii,  if  not  documented  by  sputum  sample,  was  later  identified  by  bronchoalveolar 
lavage,  open  lung  biopsy,  or  at  autopsy. 


Table  11  A:  Estimated  vs.  Observed  Mortality  for  Patients  with  Pneumonia 


Type  of 
Pneumonia 

N 

Estimated 
Mortality  (%) 

Observed 
Mortality  (%) 

X2  p  value 

PCP 

26 

28.2 

34.6 

0.53 

NS 

NPCP 

18 

28.9 

72.2 

16.45 

<  .005 

PCP=  Pneumocystis  cnrinii  pneumonia 
NPCP=  Non -Pneumocystis  carinii  pneumonia 


Table  11  B:  Estimated  vs.  Observed  Mortality  for  Patients  with  Pneumonia 

Type  of  Estimated  Observed 


Pneumonia 

N 

Mortality  (%) 

Mortality  (%) 

IX 

N> 

p  value 

PCP 

■26 

28.2 

34.6 

0.53 

NS 

Other  pathogen 

11 

23.2 

63.6 

_ * 

=  .005 

No  pathogen 
identified 

7 

37.8 

85.7 

_ * 

=  .014 

PCP=  Pneumocystis  cnrinii  pneumonia 

*  Signifies  that  statistical  significance  was  calcutated  using  a  binomial 
distribution  (1-tailed) 


Table  12:  Estimated  vs.  Observed  Mortality  by  APACHE  II  Scores 


Estimated  Observed 


APACHE  II  Score 

N 

Mortality  (%) 

Mortality  (%) 

x2 

p  value 

All 

161 

35.5 

44.1 

5.19 

<  .025 

<  14 

45 

11.2 

24.4 

7.94 

<  .005 

15-19 

43 

24.2 

37.2 

3.97 

<  .05 

20  -  24 

28 

39.1 

46.4 

0.63 

NS 

>25 

45 

68.5 

68.9 

.003 

NS 

Table  13:  Estimated  vs.  Observed  Mortality  by  APACHE  II  Scores,  Excluding 

Patients  with  Pneumonia 


Estimated 

Observed 

p  value 

APACHE  II  Score 

N  Mortality  (%) 

Mortality  (%) 

X2  (d.f.=  1) 

All 

117  38.2 

41.9 

0.67  NS 

<  14 

30 

10.0 

10.0 

0.00 

NS 

15-19 

25 

21.3 

36.0 

3.28 

NS 

20-24 

21 

37.1 

42.9 

0.29 

NS 

>  25 

41 

69.6 

68.3 

0.03 

NS 

Table  14:  Estimated  vs.  Observed  Mortality  by  APACHE  II  Scores  in  Patients 

with  Pneumonia 


Estimated  Observed 


APACHE  II  Score 

N 

Mortality  (%) 

Mortality  (%) 

x2 

p  value 

All 

44 

28.5 

50.0 

9.98 

<  .005 

<  14 

15 

13.6 

53.3 

_ * 

=  .0002 

15-19 

18 

28.2 

38.9 

1.02 

NS 

20-24 

7 

45.1 

57.1 

_ * 

NS 

>  25 

4 

57.2 

75.0 

_ * 

NS 

*  Signifies  that  statistical  significance  was  calculated  using  a  binomial 
distribution  (1-tailed) 


Table  15:  APACHE  II  Scores  for  Survivors  and  Nonsurvivors 


Average  APACHE  II  Scores  ±  S.D. 

Disease  Category 

Survivors  (N) 

Nonsurvivors  (N) 

p  value 

Pneumonia 

16.5  ±4.5  (22) 

17.5  ±5.9  (22) 

NS 

Not  Pneumonia 

17.3  ±6.9  (68) 

26.3  ±10.3  (49) 

<  .005 

Table  16  A:  Mortality  in  Patients  with  Pneumonia 


Survivors 

Non-Survivors 

Observed 
Survival  (%) 

PCP 

17 

9 

65.4 

NPCP 

5 

13 

27.8 

X2  =  6.018  p  =  0.014 

Table  16  B:  Mortality  in  Patients  with  PCP  vs.  Pneumonia  of  Unknown 

Etiology 


Observed 


Survivors 

Non-Survivors 

Survival  (%) 

PCP 

17 

9 

65.4 

Unknown  etiology 

1 

6 

14.3 

Fischer  Exact  test  (2  tailed);  p  =  .030 


Table  16  C:  Mortality  in  Patients  with  Pneumonia,  Excluding  Patients 

without  a  Known  Etiology  of  the  Pneumonia 


Observed 

Survivors  Non-Survivors  Survival  (%) 
PCP  17  9  65.4 

Other  identified  pathogen  4  7  36.4 


Fischer  Exact  test  (2  tailed);  p  =  .151 


PCP=  Pneumocystis  carinii  pneumonia 
NPCP=  Non-P neumocystis  carinii  pneumonia 


Table  17:  Severity  of  Illness  in  Patients  with  PCP  vs.  Non-PCP  Pneumonia 


PCP  (n=26) 

NPCP  (n=18) 

p  value 

Requiring 

Intubation  (%) 

85 

83 

NS 

Mean  APACHE  II 
Score 

16.9  ±5.0 

17.1  ±5.5 

NS 

Predicted 

Mortality  (%) 

28.2  ±  13.8 

28.9  ±  15.4 

NS 

PCP  -  Pneumocystis  carinii  pneumonia 
NPCP  =Non -Pneumocystis  carinii  pneumonia 


Table  18:  Association  between  Total  Lymphocyte  Count  (TLC)*  and  Mortality 


TLC  #  of  Survivors  #  of  Non-survivors 


<  200 

14 

jf. 

A.  Total  Population 

22 

Chi-square  =  8.96 

>201 

75 

37 

p  =  .003 

<200 

2 

B.  Pneumonia  or  Sepsis 

12 

Chi-square  =  9.07 

>201 

31 

21 

p  =  .003 

<200 

12 

C.  Not  Pneumonia  nor  Sepsis 

10 

Chi-square  =  2.62 

>201 

44 

16 

p  =  .105 

<200 

1 

D.  Pneumonia 

7 

Fischer  Exact  (2-tailed) 

>201 

19 

13 

p  =  .048 

<  200 

1 

E.  Sepsis 

5 

Fischer  Exact  (2-tailed) 

>201 

12 

8 

p  =  .163 

TLC  was  calculated  using  the  formula  TLC  =  #  of  WBC's  x  lymphocyte  differential  (% 
WBC's  which  were  lymphocytes). 

3(* 

Of  the  total  population  of  161  patients,  148  (91.9%)  had  the  necessary  information  in  their 
charts  for  the  calculation  of  TLC. 


Table  19:  Estimated  vs.  Observed  Mortality  by  Location  Prior  to  MICU 


Estimated 
Mortality  (%) 

Observed 
Mortality  (%) 

X- 

p  value 

Emergency  Room  (n=81) 

37.6 

44.4 

1.62 

NS 

Hospital  Floor  (n=72) 

33.3 

44.4 

4.02 

<  .05 

Table  20:  Estimated  vs.  Observed  Mortality  by  MICU  Length  of  Stay 


Estimated 

Observed 

Mortality  (%) 

Mortality  (%) 

X2  p  value 

MICU  LOS  <  14  days  (n=  144) 

34.9 

40.3 

1.83  NS 

MICU  LOS  >  14  days  (n=l  7) 

41.0 

82.4 

12.02  <  .005 
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